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Integral Representations and Asymptotic 
Expansions for Closed Markovian Queueing 
Networks: Normal Usage 


By J. McKENNA and D. MITRA 
(Manuscript received October 28, 1982) 


In designing a computer system, it is vitally important to be able to 
predict the performance of the system. Often, quantities such as 
throughput, processor utilization, and response time can be predicted 
from a closed queueing network model. However, until now the 
computations involved were not feasible for large systems which call 
for models with many processing centers and many jobs distributed 
over many classes. We give a radically new approach for handling 
such large networks—an approach that begins with a representation 
of the quantities of interest as ratios of integrals. These integrals 
contain a large parameter reflecting the size of the network. Next, 
expansions of the integrals in inverse powers of this large parameter 
are derived. For cases in which the number of processing centers is 
greater than one, this ts the only technique we know of that yields the 
complete asymptotic expansion. Our method for computing the terms 
of the expansion can be interpreted as decomposing the original 
network into a large number of small “pseudonetworks.” Our tech- 
nique also yields easily computed error bounds when only the first 
few terms of the expansion are used. This method has been imple- 
mented in a software package with which we can analyze systems 
larger by several orders of magnitude than was previously possible. 


l. INTRODUCTION 


Closed Markovian queueing networks, which are tractable in having 


661 


the product form (or separability) in their stationary distribution, 
continue to have a profound influence on computer communication, 
computer systems analysis, and traffic theory."* The closed networks 
have been used to model multiple-resource computer systems,” 
multiprogrammed computer systems,”*® time-sharing,” and window 
flow control in computer communication networks;”’” networks with 
blocking’” require the analysis of a large number of closed networks. 
Not surprisingly, considerable effort has gone into devising efficient 
procedures for computing the partition function,’*'’ an element of the 
product form solution requiring significant computation. More re- 
cently, mainly spurred by parallel technological development in com- 
puter communication, there has been a focusing of effort on large 
closed networks!""”'**! with many classes of jobs and transactions and 
large populations in each class. The point of departure of this effort is 
the realization that the earlier recursive techniques for computing the 
partition function are severely limited in terms of computing time, 
memory storage, and attainable accuracy when it comes to the large 
networks presently demanding analysis. 

In an earlier paper,’ we introduced a new approach to calculating 
the partition function. We showed there that the partition function 
could be represented as an integral containing a large parameter which, 
in some sense, reflected the large size of the network. In general, the 
partition function is represented by a multiple integral. However, in 
the special case where there is only one node at which queueing can 
occur (a node of type 1, 2, or 4 in the terminology of Ref. 4), the 
partition function is represented by a single integral. In Ref. 21 we 
applied standard theory to obtain asymptotic expansions of the integral 
representation. The standard techniques, however, cannot be extended 
directly to multiple integrals. In this paper, we make use of the special 
properties of our integral representation and obtain a method for 
generating asymptotic expansions of our integrals which works equally 
well for single or multiple integrals. For our single integrals, the 
techniques developed in this paper are easier to apply than the 
standard techniques we used in Ref. 21. For our multiple integrals, our 
technique is the only one we know of which can be used to obtain 
higher order terms in the asymptotic expansion. 

The computational effort of solving a large network with p classes 
and with populations in each class of the order of magnitude of 100 is 
here reduced to be roughly as complex as solving by older techniques 
for the partition functions of (p choose 4) networks where each of 
these networks has a total population of, at most, seven allocated over, 
at most, four classes. Thus, we have reduced the problem to the 
solution of a large number of small problems. One consequence is that 
for p large enough, even our technique will be computationally intrac- 
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table. Our results nevertheless allow large, previously intractable net- 
works to be solved with vastly reduced computational effort and with 
great accuracy. For example, a network with 20 classes, each class 
having a population of 100, can be handled easily by our technique. 
Two noteworthy aspects of this work are (z) the built-in notion of 
depth of accuracy: by computing more terms of the asymptotic expan- 
sion, it is possible to match computational effort to desired degree of 
accuracy, and (iz) a comprehensive error analysis that allows estimates 
to be accompanied by sharp error bounds with little incremental effort. 
Let us elaborate. 

Section III recapitulates and extends the results in Ref. 21 to obtain 
representations as integrals of the partition function of most, but not 
all, closed product-form networks.* The class-by-class breakdown of 
the utilization of each processor, itself simply related to mean response 
time and throughput, is given in terms of a ratio of two integrals. 
These are multiple integrals with multiplicity equal to the number of 
centers in the network where queues may form. 

The asymptotic expansions are in powers of (1/N) where WN is a 
parameter designed to reflect network size. Our computational expe- 
rience has been that five terms in the mean value expansions are 
generally adequate both for large networks, for which our asymptotic 
technique is particularly well suited, and, to a surprising extent, for 
small networks as well.” Section IV gives the procedure for generating 
the general coefficient of the expansion, while the leading coefficients 
are explicitly derived. 

A remarkable feature of the composition of the coefficients make 
their computation amenable to various techniques. As shown in Sec- 
tion 4.4, the coefficients turn out to be very simply related to the 
partition function of a certain hypothetical network which we call the 
pseudonetwork. The topology is related but not identical to that of the 
given network and the processing rates are quite different. Most 
importantly, however, to compute the leading expansion coefficients, 
it is necessary to compute the partition function for only small popu- 
lations in the pseudonetwork. Thus, to compute five terms of the 
utilization expansion, we need only consider the total population over 
all classes to be at most seven in the pseudonetwork. Small population 
in the pseudonetwork has the consequence of its partition function 
being solvable by existing recursive techniques of proven efficacy. 

Section V proves that the series in (1/N) given in Section IV is 
endowed with properties substantially more desirable than those pos- 
sessed by asymptotic expansions in general. It is shown in Section 5.2 
that the truncation error is numerically less than the first neglected 
term of the expansion, and has the same sign. Thus, except for the 
effort in computing an additional term, all that is generally needed for 
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an error analysis is already available in the basic series that is com- 
puted. 

We note here that while he mathematical literature on single 
integrals is extensive,” th re is little on asymptotic expansions of 
multiple integrals. Two notable investigations along lines different 
from here are Bleistein’s” and Skinner’s.”° Both are marked by extreme 
complexity. Bleistein gives the leading coefficient, while Skinner ob- 
tains the second term. Both terms are quite complicated. 

Let us now elaborate on some limitations of the paper. An important 
conclusion of Ref. 21 is that qualitatively different expansions of the 
integrals exist depending on whether usage is “normal,” “high,” or 
“very high.” This is even more true in the present context of multiple 
processing centers. Therefore, this paper is devoted exclusively to the 
case of normal usage. We propose to consider the remaining usage 
conditions in the future. Exactly what is meant by normal usage is 
explained in Section 4.1. 

The paper also assumes that, for each class of jobs, the routing 
through the network contains at least one infinite server (1S) center. It 
turns out that for networks in which this is not true, the asymptotic 
expansions are more appropriately derived in the context of either 
high or very high usage conditions. However, certain basic results on 
the integral representations of partition functions and mean values are 
derived in Section III regardless of whether Is centers exist for all 
classes. This paper does not allow load dependent service rates in the 
first-come-first-serve centers. 

The results in this paper have been incorporated in a large software 
package and this will be reported elsewhere. No results on moments of 
individual queue lengths are given here. However, their integral rep- 
resentations and asymptotic expansions are very similar,”’ and the 
details will be published elsewhere. 


li. PRODUCT FORM IN STATIONARY DISTRIBUTIONS: PRELIMINARIES 


2.1 Product form 


We recapitulate some of the well-known results* concerning product 
form in stochastic networks and present them in the form that will be 
used later. 

Let p be the number of classes of jobs and reserve the symbol 7 for 
indexing class. Hence, when the index for summation or multiplication 
is omitted, it is understood that the missing index is 7, where 1 =] = 
p. A total of s service centers are allowed. We will find it natural to 
distinguish the centers of types 1, 2, and 4 which have queueing from 
the remaining centers of type 3 which do not. (The definition of type 
1 to 4 centers is given in Ref. 4.) Thus, centers 1 through g will be the 
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queueing centers, while (g +:1) through s will be the type 3 centers, 
which have also been called think nodes and Is nodes. We reserve the 
symbol i for indexing centers. Also, whenever class and center indices 
appear together, the first always refers to class. 

Let the equilibrium probability of finding n;; jobs of class 7 at center 
jl<j=p,1si<s,bem(yi, yo, +--+, ys), where 


yi A (Mu, Nai, +++ , Npi), l=i<s. (1) 


Closed networks are characterized by conservation of jobs in each 
class. That is, the population of jobs of the jth class is constant at K;, 
say. The well-known results on closed networks with the product form 
in its stationary distribution may be given in the following form: 


1 Ss 
T(Y1,°**,Ys) = G U Tilyi), (2) 


where 


‘fi 
t 


wil Yi) =, (y) ni) TI (2) ’ 1s t= qd; 


nji! 


-11(%). (q+1)<Siss. (3) 


nji! 


In the above formulas, we have taken into account the previously 
stated assumption; namely, for the first-come-first-served discipline in 
type 1 centers, the service rate is independent of the number of jobs in 
queue. Also, in (3), 


expected number of visits of class 7 jobs to center z 
Pyi a ——<— en ee ee (4) 
service rate of class 7 jobs in center z 
where the numerator is obtained from the given routing matrix by 
solving for the eigenvector corresponding to the eigenvalue at 1. 


In (2), Gis the partition function, and it is explicitly 


GK)= Yo. YS Taya, (5) 


1’n, =A) 1n,=Ki i=1 


where we have written 1’n; for )j-1 nj; and the condition 1’n; = 4; to 
indicate the conservation of jobs in each class. Thus, 


aw = 5-3 / fon] Test]. @ 


; n Gi 


i= i=qt+ 


In Section III, we will refer to this expression for the partition function. 
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2.2 Asymptotic expansions 


A series 
z A,/N* 
is said to be an asymptotic expansion” of a function J(N) if 
I(N) —F As/NE = 00N™ as N— © (7) 
for every m = 1, 2, --- . We write 


I(N) ~ ¥, Ax/N*. 
k=0 
The series may be either convergent or divergent. 


lil. INTEGRAL REPRESENTATIONS 


As the representations presented here are basic to the subsequent 
development, we have allowed some duplication in Section 3.1 with 
Section 10 of Reference 21. 


3.1 Partition function 


We start with Euler’s integral representation for the factorial, 


n} -| e “u"du. (8) 
0 


Returning to (6), we use this representation to write 
()) nji)! ~ | ee II u;"'dui, i= 1, 2. coe yd. (9) 
0 


Substitution in (6) gives 


a= [of em(- 34) 2. 25 [Herd 


2 "i 
| TT ft mal du; +++ dug. (10) 
i=q Jie 


Now by the multinomial theorem, 


conn [oo(-S) 


Kj 
‘TI {Z Pyili + y ea} du; +:- dug. (11) 





i=q+1 
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It is noteworthy but not surprising that the parameters p;; for all the 
type 3 centers appear lumped together. Hence, we may simplify the 
notation by introducing pj, where 


Pjo = y Pyis J=1,2, +++, p. (12) 
i=qg+1 


Another consequence of the notation is that the center index i may 
henceforth be understood to range over the processing centers only, 
le, lLSisg. 

The new quantity pj has the physical significance of being the 
weighted combination of all the mean think times of the Is centers in 
the routing of the jth class. In particular, if the routing of the jth class 
contains at least one Is center, then pj. > 0 and otherwise p,jo = 0. Let 
I be the collection of indices of classes of the former type and let J* be 
the complementary collection, i.e., 


JETS po >0 and JEL* S pp = 0. (13) 
With this notation, 


o=[noi/nxa] [few eu 


K 
x [] {3 +p Maul II {3 paws} du, --+ dug. (14) 


jel jeEl* 


In vector notation, which we shall use widely, this reduces to 
c=] of / TL: | | e "TT (67+ ru) (15) 
Q+ I 


u= (uw, U2, °°°, Ug)’ 


1L=(1,1,---, 1)’ 


where* 


— , 7 
Yj = (ry, M2, °+* 5 Tig)’; l<j=p 


ri = p/p We FET 


=pi if JEl* 
5r=1 if fel 
=0 if ,El* 
Q* = {ulu=o}. (16) 


* Unfortunately, 7;; here is defined to be the reciprocal of the natural extension of r; 
in Ref. 21. 
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We now introduce the large parameter N and define 
BA K/N, 1578p, (17) 
T; A Ny,, l=/J=p. (18) 


The suggestion in the notation is that in the generic large network 
{6;} and {I\;} are 0(1). That is, the ratio of processing time to think 
time is, in order of magnitude estimation, proportionately less for 
increased populations. There is great latitude in the choice of the large 
parameter. The guiding principle in choosing it should be that the 
resulting values of { 8;} and {Tj;} are as uniformly close to 1 as possible. 
In practice, we have used 


1 
N = max {=}. (19) 
y Vj 


On substituting (17) and (18) into (15) and after the change of variables 
z =u/N, we obtain from (15) another useful integral representation of 
the partition function which is distinguished by its dependence on N. 
Summarizing for future reference, we have 


Proposition 1: 


J 


G(K) = i pid / Il x | et" T] (67+ rju)du (20) 
jel J Qt j 


= a Il of / ral | e V2 dz, (21) 
jel J Qt 
where 
P 
f(z) 4 V’z— > Blog(6;7 + 1';z). (22) 
k=1 
= 


3.2 Mean values 


We restrict our attention to the mean value u,;(K) which gives the 
utilization of the zth processor by jobs of the oth class for a population 
distribution by class in the network denoted by K = (Ki, Ko, --- , K,). 
Other interesting mean performance indices, such as throughput and 
mean response time are known to be simply related to {u,;(K)} and 
the interested reader may consult Ref. 17. 

In Ref. 17, 


G(K _ e,) 


Ugi(K) = Poi GK)” 


lsosp,1Sig<q, (23) 


where e, is our notation for the vector with the oth component unity 
and all other components zero. Thus, the value for the partition 
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function is needed for the given population distribution and also for 
the population in the oth class reduced by 1. 
Now from (20) 


K. 
Po0 I P76 
= JE —l'u ’ 
G(K + e,) Te Dink FDI KE I. e “(1+ riu) 
x al (dj7 + rju) ‘du if oE€l (24) 
J 
K. 
II D6 
= ee e T(r?) 
(K, + 1) [] Ki! Jo. : 
x Il (Oj. + r’u)du if oE€l*. (25) 


J 


From (23) to (25) and the same change of variables, namely z = u/N, 
employed in transforming (20) to (21) we obtain the following repre- 
sentation of the utilization in terms of integrals, 


Proposition 2: For class index o, 1 = o S p, and center index i, 
lsi<q, 


| (I’.z)e! dz 


1 
w(K + e,)7! = {_——_____ ee ape vee, AF 
Uai(K + e,) [aE oa j 2) 
e MB dy 
CL] 


Some digressionary comments are as follows. Note that the above 
conceals that r,; is normalized differently, but not unexpectedly, de- 
pending on whether o € J or otherwise [see (16) ]. Also note that in the 
typical large network for normal operating conditions we always expect 
r,i to be 0(1/N), precisely because of the normalization used, and K, to 
be O(N) so that the term in braces in (26) is then O(1). 


IV. ASYMPTOTIC EXPANSIONS 


We henceforth consider only networks in which the route for each 
class always contains an infinite server center. Specifically, 


pjo > 9, J=1,2,--+,pD, (27) 
and the set [* is empty. 


4.1 The assumption of “‘normal usage’”’ 
Define 
aS1-Y Br; (28) 
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so that in terms of the original network parameters 


gear VK peice cg. (29) 
Po 
It is important to note that a is independent of the choice of N. The 
parameter a;(—% < a; < 1) is an indicator of the unutilized processing 
capability of the zth center. Positive values of a; correspond to less 
than 100 percent utilization of the processor and negative values which, 
of course, can occur to very high utilizations.* Normal usage in large 
networks will almost certainly require a; > 0, and in all likelihood a; 
will not be close to 0 for all z. 
Hereafter, we assume 


a>0, i=1,2,-+-,4, (30) 


which condicion we refer to as normal usage. Moreover, as a; for some 
t comes close to 0, the expansions given here are not as efficient as 
those derived specifically for such conditions and which we propose to 
give in the future. 

A justification of the usage interpretation that we have given to a is 
provided by a result obtained later (see below Proposition 6) which 
states that, asymptotic with network size, 


u; = utilization of ith processor ~ 1 — a;. (31) 


An obvious caveat is that this result is derived for the assumption in 
(30). However, as the utilization in (31) can come close to unity even 
while (30) is satisfied, (31) suggests that for large networks normal 
usage will not extend beyond the range a > 0. 

Observe that 


f(0) =0 (32) 
and Vf(z)=1- 2 i aan ry I; (33) 

so that 
a = Vf(0). (34) 


The assumption of a > 0 and the form in (33) ensures that the function 
f has no stationary points in Q* since 


Vf(z) = VfF(O) > 0, ZEQ*. (35) 
Also observe that 


* Unfortunately, a here has an opposite sign from the natural extension of a as 
defined in Ref. 21. 
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af oe By TY. 
Ie] =2 (1 + I¥z)* ee 28) 


from which we note that the Hessian is positive semidefinite. 

To conclude, with normal usage, f is a convex function, with its 
minimum in Q* attained at 0 and with no point in Q* where its 
gradient vanishes. 


4.2 Transformations on integrals exploiting normal usage 


Consider the following transformations on the basic integral: 


| o-N@ dg = | o-Nf@+NYB(L2)-NYB Liz) ty 
= | e "exp 
— vy B{Tyz — log(1 + Piz) fa 
J 


= N~? | e-*“exp 


_ Ly pj T in — N tog( 1 + x ra) Ja, (37) 


where u = Nz. Now make the following change of variables, 





U; A ajli, lsisq (38) 
and normalize the system parameters with respect to a, thus, 
T,ATji/o, lsjsp,l1sisgq. (39) 
Observe that in particular 
lu = Dv. (40) 
From (37), 
NY 
e N@Ad7 = e !YH(N", v)dv, (41) 
: (IIa) Ae 
where 
H(N™, v) & e8N™, (42) 
Pp a 1 a 
s(N“,v) 4-3 Byte —N log(1 + N rw) (43) 
j=l 


Our notation here suggesting N~’ as the independent variable may be 
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perplexing at this stage, but it provides a clue to the direction of the 
analysis. 

Here are two further digressionary comments: the above transfor- 
mations are meaningful only in the context of normal usage, i.e., a > 
0. In a similar vein, an interpretation of {I';} as renormalized {T;i} is 
only meaningful for a > 0. It is noteworthy that hereafter we shall be 
dealing exclusively with {Tj} and not at all with {1}. 

We need to repeat the transformations given above for the integral 
f (I.z)e~! dz. The result may be combined with (42) and (43) to give 
the following compact representation: 


—(q+m) 


—l'v (Tv ..\m 24 
(Ta) Jo. ° (Iv)"H(N~, v)dv, (44) 


Qt 
m=0,1,2,++-. LJ 


(Iz)"e dz, = 
Qt 





We may now use these expressions in Proposition 2 to obtain for 1 <= 
oxpand1=i1<=q, 


5 nae = Po 
Usi(K + e,) ioe rs " 


| e (Iv) H(N7, v)dv 

i Pe 

je a se err ee (am 

Nr (45) 
| e 'YH(N, v)dv 


Note that the bracketed term is independent of the processing center 
index 1. 
Let us agree to call 


I(N) | e-'YH(N™", v)dv (46) 


and 


I (N) 3 | e (Iv) H(N“, v)dv, (47) 


where the superscript (1) is a mnemonic for first moment. In this 
notation, we have for future reference 


Proposition 3: 


: -1 _ Po0 1 IS)(N) 
Ugi(K + e,) = ae oaH lt + io P (48) 


The asymptotic expansions of J(N) and I‘)(N) are considered next. 
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4.3 Asymptotic expansions 


This section will outline the procedure for obtaining the asymptotic 
expansions together with plausibility arguments. The proofs of the 
assertions concerning asymptoticity will follow in Section V. Moreover, 
we defer till Section 4.4 certain observations which make feasible the 
efficient computation of the coefficients of the asymptotic expansions. 

Our procedure for J(N) is to first obtain a power series in N° of 
H(N~™, v) and then to integrate term by term. Thus, we let 


~ Ar(v) 








H(N™ = 
(N™, v) Py ne (49) 
and 
A, & | e 'Yh,(v)dv (50) 
Qt 
and claim that 
oo Ap 
I(N) ~ ¥ —=. 51 
2» ne (51) 
Let us elaborate on the coefficients {h;(v)} in (49). 
1 @& 
h = —______ (0) =0,1,2,-.--. 2 
2(V) Rh a(l/N)* ( ’ v), k ’ i, ’ (5 ) 
To make these explicit, we need to first introduce 
(—1)* Tyv._\k 
fi(v) A i pa B(T NV)", kR=1,2,---. (53) 
J 
Their role becomes clear if we recall (42): 
H(N™, v) =e), (54) 


and note from (48) that for fixed v € Q*, s(N~’, v) and, hence, 
H(N™, v) are functions of N~', analytic in Re(N~') > e(v), where 
e(v) < 0. Then, 


s(0,v) = —Rlfiai(v), kR=1,2,---. (55) 


To proceed now to the derivatives of H(-, v) itself, we will find useful 
the following expression in which it is understood that all derivatives 
are with respect to N7: 


k 

H**\(N-, v) = ¥ (Asura, v)H'™(N~, v), 
m=0 

k=0,1,+:-. (56) 
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From (52), (55) and the above, it is easy to see that there exists the 
following recursive scheme for generating {h;(v)}: 


ho(v) =1 
k 


rar 2, (k + 1 — m) fr+2-m(V)Am(V), 


-k=0,1,2, ++: (57) 


Arvi(v) = — 


In particular, the leading elements are 
h,(v) — 1, k = 0 
= — fo(v), k=1 


2 


1 
=—filv) +5 filv), A 


1 
= —f,(v) + Al(v)fa(v) — re f2(v), k=3. (58) 


To summarize the steps discussed so far in the generation of the 
asymptotic expansion, we have 


Proposition 4: 


roe) A, 
KN) ~ 3 5 
where A; = fe~*‘h,(v)dv, {hi(v)} is obtained recursively from (57) 
with leading elements exhibited in (58), and { f,(v)} is asin (53). OO 

The aspect of the above asymptotic expansion of the integral J(N), 
which consists of decomposing the integrand into the product of an 
exponential and a function, the expansion of the latter function in a 
power series, and the final term-by-term integration is like the proce- 
dure which, in the context of single integrals, is justified by Watson’s 
Lemma” under certain conditions. Our contribution has been to show 
that a generalization of this fundamental result exists for the multiple 
integrals of interest in stochastic networks. 

Our procedure for obtaining the asymptotic expansion for IS) (NV) is 
very similar and consists of obtaining a power series in N™ of 
(I’.v)H(N~',v) and integrating term by term. However, we notice the 
simplifying fact that 


(59) 


(vy HN -5 oe (60) 


Thus, the procedure for this integral is as follows: 
Proposition 38: 
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oo A‘ 
T)(N) ~ py WE? (61) 
where 
AM A | e *(I.v)h(v)dv, k=0,1,2,---. (62) 
, O 


As observed in Ref. 21, the asymptotic expansion for che integrals may 
be used to generate asymptotic expansions for their ratios on account 
of powers of N~* forming a multiplicative sequence.” Thus, the coef- 
ficients of an asymptotic expansion for u,;(K + e,)~' may be obtained 
from formal substitution in Proposition 2 of the expansions in Propo- 
sitions 4 and 5. This gives 


Proposition 6: 





Poi( Ko ate ah -] 1 OD 2 fr 
—————— }u,i(K + e, ~i1+— —,, 63 
| = (K + e,) 5m (63) 
Age 
1 a 
= |AR- Y AmBorm|, B= % +++. O 
0 m=1 


With the above proposition we may generate (k + 1) terms of the 
expansion for u,; from k& terms of I(N) and IS)(N). 
An immediate corollary to the above proposition is 
Pike 


Ugi(K) ~ Deon 


and 


u;(K) = utilization of 7th processor = ¥\ u;;(K) 
J 


La; 1=1,2,--+,q, (65) 


which was claimed earlier in Section 4.1 in the course of giving physical 
meaning to the parameters {a;}. 

This corollary illustrates the important point that the terms in the 
asymptotic expansions A;/N*, A‘;/N*, and B,,/N**! are all inde- 
pendent of N and depend only on the network parameters. The dummy 
parameter N serves to show how to group the terms of the same 
magnitude. This independence from N follows from the fact that a is 
independent of N, as noted earlier, and from the fact f,(v)/N*™ is 
easily seen to be independent of N. Once this is noted, the result 
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follows easily from the definitions of the terms. The choice of N is 
important numerically, which will be discussed in a subsequent paper. 


4.4 Pseudonetworks and the computation of expansion coefficients 


Here, we consider the compositions of coefficients, {A,} and {A ,;, oy 


and find that quite remarkably they are related intimately to the 
partition function of a certain hypothetical network which we call the 
pseudonetwork. It turns out that to compute the leading elements of 
{A;} and {Avy z}, we need to consider the pseudonetwork with only 
small populations. Thus, existing techniques known to be effective for 
computing partition functions for small populations may be used to 
compute the leading coefficients of our asymptotic expansions. 

An example will prove useful. From Proposition 4 and eq. (58) we 
see that 


A3 = | om — fiw + fo(v) fa(v) — = Fi fa. (66) 


Now consider only the third term after denoting it by A33. Using the 
expression for a as given by (53), we obtain 


Apis a> B: | e!* (Fv) *dy 


~ = y BB. | el (Iv)4(Iv)*dv 
J#k 


ee » BBB | e*(Tyv)*(Tv)*(Piv)’dv, (67) 
48 JAkAl zy 

where the subscripts /, k, and / are class indices ranging over [1, p]. We 
now make the observation that the generic integral in the composition 
of the asymptotic expansion coefficients is within a multiplicative 
constant of 


eT] (u)™du. (68) 


ane 
(11m) Bs j 


The above is an important form for we may now identify it with 
quantities previously encountered. 
We first give the following equivalent expression for g(m). 


Bi): Die Pe It { (m0): 1. (68’) 


V’ni=™) Vnp=mp t=1 J 


g(m) = g(mi, Mea, ***, Mp) 4 


Recall the expression in Proposition 1, eq. (20), for the integral repre- 
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sentation of the partition function. Specialize the expression there to 
a network with no infinite server centers, i.e., set J is empty, and find 
that it reduces to 


G(K) = = | os at (rju)du. (69) 
(1 x) + Jj 


On comparing (68) and (69), or (68’) and (6), we may conclude that 
g(m) is the partition function of a certain network. 

Call this hypothetical network the pseudonetwork. What character- 
izes the pseudonetwork? To begin with, it is closed and lacks Is centers. 
There are, as in the original network, exactly g processing centers and 
p classes of jobs. The processing rate of jobs from the jth class in the 
ith center of the pseudonetwork is Ti, where, you will recall, lj; = 
I;/a;. In agreement with past convention, (771, m2, -++ , mp)’ denotes 
in vector form the population distribution by class in the network. 

We may follow the procedure outlined in the example concerning 
A33 above to express all the leading coefficients A;, k = 0, 1, 2, 3 in 
terms of the partition function of the pseudonetwork. This gives 


1 
Az = 2) Big (3e,;) + 3 Y Bj g(4e,) + 5 & Bibis (2e; + 2en) 
j j jh 
A3 = —6 > Bg (4e;) — 20 > Bj g(5e,) — 15 py B; g (6e;) 
J J J 
— 2) B Beg (2e; + 3e:) 
J#k 


—3 ¥ B7Brg(4e; + 2ex) 
J#k 


1 
—~ Y BBeBig(2e; + 2e, + 2e)). (70) 


6 jhptpj 


In these expressions, j, k, and / are class indices each with range 
[1, p]. 

Notice that in the computation of A;, k = 0, 1, 2, 3, the population 
distribution m that appears in g(m) may be characterized as being 
quite small. The total population in the pseudonetwork over all classes, 
)} mj, 1s at most 6. A further simplifying condition is that we need 
consider only distributions with three classes at most with nonzero 
populations—the extreme distribution arises in g(2e; + 2e, + 2e;). 

Since we are interested in population distributions where all but a 
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small number of classes have no members at all, we may equivalently 
choose to view the pseudonetwork as a collection of smaller networks 
each with a small (less than p) number of classes of jobs. For example, 
g(2e; + 3e,) may be viewed either as the partition function of the 
pseudonetwork in which all but classes 7 and k have zero population, 
or as originating from a network with only two classes with population 
distribution (2, 3) but one which is otherwise unchanged. Such distinc- 
tions, while not material to the procedures given here, may be conse- 
quential in the efficiency of the computations. 

The coefficients {A‘}} of the expansion for I$)(N) may also be 
expressed in terms of the partition function of the pseudonetwork. As 
the derivation is similar, it will suffice to give the results, which we do 
in the Appendix. 

It will be observed that to compute {A“"}}, k = 0, 1, 2, 3 we need to 
consider various allocations to classes of a total population in the 
pseudonetwork of at most 7. Thus, to compute the leading five terms 
of the utilization u,;, we need A, and {A‘2} for k = 0, 1, 2, 3, and these 
are computed from the values of the partition function of the pseudo- 
network for various allocations to classes of a total network population 
of at most 7. This is an elaboration of a claim made in the Introduction. 


V. ERROR ANALYSIS AND PROOF OF ASYMPTOTICITY 


Equations (59) and (61) contain claims requiring proof on the 
asymptoticity of the expansions of J(N) and I“ (N). This is provided 
here as a corollary to a complete error analysis. In fact we show that 
the expansions given earlier have properties more attractive than that 
required of asymptotic expansions. For instance, asymptoticity re- 
quires that errors incurred in the estimation of the integrals from the 
use of, say, m leading terms is of the same order as the (m + 1)th term 
as N — o, We prove that the expansions derived have the stronger 
property that the truncation error is bounded by the (m + 1)th term. 
The practical benefit of this analysis is that with very little incremental 
effort we can accompany our estimates of the mean values with sharp 
estimates of the estimation error. 


5.1 Completely monotonic functions 2+ 


We need the following definition: for any nonnegative R, let Q*(R) 
be the set of nonnegative vectors with norm bounded by R, i.e., 
Q*(R) = {v|v 20 and || v|| <= 2}. Note that Q*(R) — Q* asR> ~. 

The proposition below (cf. Ref. 21) states a remarkable property of 
the function H(N™, v) which is a key to much of the error analysis. 

Proposition 7: For all v € Q*(R), R < «, H(N™, v) is a completely 
monotonic (or alternating) function of N~’. That is, 
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a” for k=0,1,2,--- 


a) ee a 
CUO sainpe >) =° ond O<NI<o, 
Proof: Consider the form for H(N~, v) from (42 to 43), Section 4.2: 
ee ca 
H(N™, v) = I] e Bi Ej) (1 + N tv) : (72) 
J 


Because products of completely monotonic functions are also com- 
pletely monotonic, it suffices to show that 


ie 
(1 + ee 


is a completely monotonic function of N~*. Let us write 


ore 
(1+ 2 tye) = a 
where 
t(w) = Pog + aw), (74) 


by identifying w = 1/N and a =I‘v. We note that 0 < w < ~ and that 
a is nonnegative and bounded, with the latter property being ensured 
by the restriction of v to Q*(R), R < o. Thus, all derivatives of 
log(1 + aw), and consequently of ¢(w), exist and are continuous for 
0= w<o, Since 


d?*{etwn k kh qkti-m d™ ” 
HT mn) Yaa df | gm FF |, (75) 


we may conclude from a simple inductive argument that 


if ¢(w) is acompletely monotonic function of w, 


t(w) 


then e*’”’ is a completely monotonic function of w. (76) 


Finally, to show that ¢(w) is a completely monotonic function of w is 
to show that {log(1 + x)}/x is completely monotonic. This is true, but 
we omit the proof. L) 


We need the following analogous property in connection with the 
integrand of I°’(N): 


For all vEQ*(R), R<~x, (Mv)H(N“, v) is completely 
monotonic in N~. (77) 


The proof is immediate from the preceding proposition since the 
additional factor (['v) does not depend on N~™. 
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5.2 Error bounds 
Proposition 8: For all N,O< N=x0o 


A m-1 Ak 
— — —_—- <r — eee 
yn < TIN) x ESO «=m =1,3,5, 
m-—-1 Ap As 
O<1(N) — bye Sym m = 2, 4,6, +++. (78) 


Proof: We initially require v € Q*(R), R < © so that the preceding 
proposition is applicable. Viewing H(N~', v) as a function of N~', we 
may use a version of Taylor’s theorem™ that is accompanied by an 
estimate of the truncation error for the series to obtain 

™) hiv) 1 1 a” 


H(N™,v) = © 


LN tN mare (79) 


where 
fe [0,N~’]. 


Consider first the case of m odd. From the preceding proposition, the 
mth derivative of H(N™, v) is a nonpositive and monotonically 
nondecreasing function of N~’. Hence, 


o”™ m 








Spee errands Se ; 
a Ny* 2 Y) Ssqiupe H& Vv) = 0, E> 0. (80) 
Substituting in (79), 
Am(v) aj m™* hiv) 
Ne < H(N™,v) a NE <0. (81) 
Hence, 
Ce ; 
| e 'YH(N',v)dv— ¥ mal e'Yh,(v)dv 
Q*(R) ica Qt(R) 
= 0 
1 7 
= Nia c ‘*hm(v)dv é (82) 
Qt(R) 


The pair of bounds holds uniformly in R. Consequently, we may let 
R-— o and drop the restriction on R to obtain (78). 

The proof for m even is very similar with the starting point being 
the following replacement for (80), 


Qo” m 


0 
0 = aqN)* a ) = Sqn)" 


(0, v). (83) 


LI 
The rest of the proof is omitted. 
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The above proposition states that the error incurred from using only 
a certain number of leading terms of the expansion for J(N) is numer- 
ically less than the first neglected term of the series and has the same 
sign. 

Another implication that can be quite useful in practice is that the 
estimate with an odd number of terms is an upper bound on the 
integral and an even number of terms gives a lower bound. Thus, the 
error sequence alternates in sign. (It is also true but less consequential 
that the terms of the expansion also alternate in sign.) 

In particular the above proposition proves the asymptoticity of the 
expansion 1n Section 4.3. 

By a matching argument and with recourse to the complete mono- 
tonicity of (I'v) H(N7 , Vv), see (77), we also have 


Proposition 9: For any class index o and allN,O= N=, 











AY. m—-1 A> 
a <][(N) - 3 we < 0, m= 1,3,5,--- 
k=0 
m-1 AY — Adam 
0<I$(N) - Py WES qym? = 24,6, 02+. (84) 


With error estimates available for both J(N) and I9)(N), it is 
straightforward to use these to obtain an error estimate for the mean 
value given in Proposition 3, eq. (48). 


APPENDIX A 


The Coefficients {A [)}in Terms of the Partition Function of the Pseudo- 
Network 


Here o 1s a given fixed class index, while 7, k, and / are also class 
indices each ranging over [1, p]. It is also understood that 7, k, Z, and 
o are all distinct. 


Ags = g (es) 
Agi = —3B.g(3e.) — Y Big (es + 2¢;) 
J 


A’ = 88,g(4e.) + 15828 (5e.) 
+ ¥' [26g (3e; + e.) + 38; g(4e; + €.) + 38.B)g (3. + 2e,)] 
Jj 


+ > B; Bre (2e; + 2er, + e,). 
Ags = —30B,g (5e.) — 120B82g (6e,) — 10583 g(7e.) 
= y [68g (4e; + e,) + 2087 g(e, + 5e;) + 6B.h;g(3e, + 3e;) 
+ 26,8; g (de, + 2e;) 


QUEUEING NETWORKS _ 681 


+ 158} g(e. + Ge) + 15828; g (5e, + 2e) + 98.87 g(8e,+ 4e,)] 


=); | 28,66 (e, + 2e; + 3e,) + 387 Brg(e, + 4e; + 2e,) 
TR 


3 
+ 5 BoB; Beg (8e. + 2e; + 20) 


1 
ie >) BiBrBig(2e; + 2er + 2e; + e,). 
Dk 
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Parasitic Insensitive, Biphase Switched 
Capacitor Filters Realized With One 
Operational Amplifier Per Pole Pair 


By K. R. LAKER, P: E. FLEISCHER, and A. GANESAN 
(Manuscript received October 9, 1981) 


Practical techniques are given for reducing the number of opera- 
tional amplifiers (op-amps) in switched capacitor filters. Op-amp 
count is typically reduced to one op-amp per pole pair, while main- 
taining the insensitivity to top and bottom plate parasitics heretofore 
associated with one-op-amp-per-pole structures. These techniques are 
used to develop a parasitic insensitive single amplifier resonator and 
a general single amplifier biquad. Next, complete design procedures 
are given for these circuits. Finally, the same methods are applied to 
leapfrog structures, where similar op-amp savings are demonstrated. 


l. INTRODUCTION 


The use of active switched capacitor (sc) filters’? as constituents in 
large-scale integrated (Ls1) subsystems*”” has been rapidly expanding. 
Crucial to the realization of manufacturable metal-oxide-semiconduc- 
tor (mos) sc filters has been the development of parasitic insensitive 
sc networks.*'’”” As a consequence, it has become a commonly ac- 
cepted notion that insensitivity to both top and bottom plate parasitic 
capacitances requires realization with one op-amp per pole (i.e., two 
op-amps per pole pair). However, since op-amps consume power, 
represent about 20 percent of a filter’s die area, and are sources of 
noise and power supply feed, it is useful to consider techniques that 
reduce the number of op-amps required to implement a given transfer 
function. The purpose of this paper is to introduce practical techniques 
for achieving this, while retaining the crucial parasitic insensitivities 
mentioned previously. A straightforward technique for reducing op- 
amp count is to time share or multiplex’*"* each op-amp among two or 
more storage capacitors. A problem with this approach is that each 
Op-amp operates without feedback during the dead zones of the non- 
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overlapping clocks and can drift to one of the supply rails. To remedy 
this problem, a simple sc feedback circuit is introduced which ensures 
that closed loop stability is maintained at all times. In addition to the 
stability problem, multiplexing within a single filter tends to result in 
a parasitic-sensitive implementation.’* This is because many opera- 
tions within such a filter involve sensing voltages during a clock phase 
and transferring a proportional charge during a latter phase. In the 
past, when inverting transfers of this nature were needed, the only 
recourse has been to use the parasitic-sensitive “toggle” switched 
capacitor. However, using the parasitic-compensated integrators’””® 
introduced by the authors, sc filters involving multiplexed op-amps 
can be realized which are insensitive to parasitics. In this paper, we 
show how additional features provided by these new integrator reall- 
zations enhance the design flexibility available with multiplexed op- 
amp sc structures. In addition, we develop practical, parasitic-insen- 
sitive realizations for single-amplifier biquads and N/2 op-amp Nth 
order* leapfrog structures. The op-amp-reduced circuits are derived 
step-by-step from conventional one-op-amp-per-pole prototypes. 


ll. NEW PARASITIC-COMPENSATED SC INTEGRATOR REALIZATIONS 


Prior to developing the single-amplifier biquad, let us review the 
parasitic-compensated integrators’”’® mentioned in the previous sec- 
tion. Consider first the integrator in Fig. la with the relevant parasitic 
capacitances at nodes I and 2 denoted as Cp: and Cypo, respectively. To 
establish the conditions for parasitic insensitivity, let us first determine 
the voltage across the holding capacitor at node 2 during the even (e) 
time slot. From Fig. 1b it follows that 


2C 
ee rc 1429 1 
Me ICC CR (1) 


Hence, the net charge transferred to the integrating capacitor D, 
during the odd(o) time slot (obtained from the circuit in Fig. 1c) is 

Q° = (2C + Cyo)z7 V5 (2) 

_ 2C(2C + Cpe 


AC+ Ct Cy Vin. 8) 


Note that if we set Cpi = Cp2 = C,, terms involving the parasitics 
cancel, i.e., 


Str Vl; 


* For N odd, (N + 1)/2 op-amps are required. 
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e e 
Vin V5 2 
r—1@ Vou 
Co1 + Cp2 ais 


(b) 
V 2C qd 
| 
| 1} 
Cor =t= Cp2=1t= 
rh 





(c) 


Fig. 1—Parasitic-compensated inverting integrator. (a) Complete circuit with para- 
sitic capacitances. (b) e-phase connectivity. (c) o-phase connectivity. 


when 
Cat = p2 >= C>. (4) 


In practice, equality of Cp = Cp2 is obtained by matching the geome- 
tries of the switches connected to nodes 1 and 2, and by matching the 
routing associated with nodes 1 and 2 using careful layout. In contrast, 
predistorting™ the capacitors to compensate for parasitics is an impre- 
cise operation because of the lack of tracking between unlike capaci- 
tors. It should be noted that in the proof given by eqs. (1) to (4), the 
parasitic capacitors were assumed to be linear. A more general proof 
establishing the validity of the matching conditions when the parasitics 
are nonlinear is given in Ref. 16. Throughout the remainder of this 
text, we shall assume parasitic matching when such an integrator is 
used. Under these circumstances, the input/output relations become 
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_ -[C/D]z-” 


out = ae ae ae Vin (5a) 
and 
—[C/D]z7 
out = — Ve (5b) 


Note that the classical inverting toggle-switched integrator shown in 
Fig. 2 has the same transfer functions. However, by virtue of the 
matched parasitics, the implementation in Fig. 1 is parasitic insensitive. 
We, therefore, refer to the input circuit of this integrator as a parasitic- 
compensated toggle (PCT). 

The noninverting integrator realization in Fig. 3 obtains its parasitic 
insensitivity via the same matching condition derived for the inverting 
integrator in Fig. 1. Derivation of the conditions for parasitic insensi- 
tivity follows identically that given in eqs. (1) through (4); hence, it 
will not be shown. Perhaps more interesting are the input/output 
relations, which are given below: 


C/D)z™' 
cut = ae Vin (6a) 
and 
% . C/O |e oF a: 
out = a Vin. (6b) 


Particular attention is directed to the extra one-half-clock-period delay 
produced by this circuit when compared to the standard noninverting 
structure.’ We shall see in later sections that this extra delay, hereto- 
fore available only by using additional clock phases, provides consid- 
erable flexibility for multiplexing op-amps in switched capacitor filters. 
In further references to the structure in Fig. 3, proper parasitic match- 
ing will be assumed. Since the input circuit to the integrator in Fig. 3 
provides inversion at the summing junction, it will be referred to as an 
inverting parasitic-compensated toggle (IPCT). 





Fig. 2—Classical inverting toggle-switched integrator. 
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Fig. 3—Parasitic-compensated noninverting integrator. 


lll. ALL POLE, PARASITIC-INSENSITIVE, SINGLE-AMPLIFIER SC 
BIQUAD 


To begin the derivation of the single-amplifier biquad, consider the 
conventional, parasitic-insensitive two-op-amp biquad'”” in Fig. 4a. 
The transfer functions” to each of the op-amp outputs (V1, Vos) are 


readily obtained from the z-domain equivalent circuit®’*”” in Fig. 5a. 
That is, 
3 —AGz" 
jee : (7a) 


~ Ve D(B+F)— (2DB + DF —AC)z + DBz~ 
and 
e —G(B + F — Bz7) 


a gee 
Vii D(B+ F) —- (2DB + DF — AC)z7' + DBz~ 


(7b) 


Let us now replace the sc elements G and C with corresponding PCTs 
according to the schematic in Fig. la. The resulting circuit is shown in 


L 


B 
~ p> 





(a) 


Fig. 4a—Switched-capacitor all-pole biquad—parasitic-insensitive realization. 
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Fig. 4b—Switched-capacitor all-pole biquad—parasitic-compensated equivalent to 
Fig. 4a. 
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(c) 
Fig. 4c—Switched-capacitor all-pole biquad—single-amplifier equivalent to Fig. 4b. 
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(b) 


Fig. 5—Switched-capacitor all-pole biquad equivalent circuits. (a) Equivalent circuit 
for the sc biquad in Fig. 4a. (b) Equivalent circuit for the sc biquads in Figs. 4b and 4c. 


Fig. 4b and its z-domain equivalent in Fig. 5b. The following observa- 
tions regarding the biquads in Figs. 4a and 4b can be made: 

(t) In both circuits, the integrating (storage) capacitors are up- 
dated during only one clock phase. During their respective hold phases, 
the op-amps are idle. 

(iz) In the circuit of Fig. 4a, storage capacitors D and B are updated 
during the same phase (in this case the e-phase). In contrast, for the 
circuit of Fig. 4b, storage capacitors D and B are updated on alternate 
phases. 

(zit) Referring to the equivalent circuits in Fig. 5a and 5b, we see 
that the loop gains for both circuits are identical. The only difference 
is the additional half-clock-period delay in the transfer function intro- 
duced for the first op-amp output. The delay between the filter input 
and the output of the second op-amp is the same in both cases. The 
transfer functions for the biquad in Fig. 4b are 
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Vion ~AGz™ es 
- D(B+F) — (QDB + DF—AC)z~ + DBz™ ie 


and 
a —z?°G(B + F — Bz™) 


Vi, D(B+F) — (QDB + DF—AC)z~ + DBz= 80) 


Reflecting on observation (1), it is seen that since each op-amp is idle 
during half the time, all the processing might be performed by one op- 
amp. Since, in addition, capacitors B and D are updated during 
alternate clock phases, according to observation (2), the two op-amps 
in Fig. 4b can be replaced with a single op-amp, multiplexed between 
the two storage capacitors D and B. The resulting single amplifier 
circuit is shown in Fig. 4c. 

Let us now make some observations regarding the single-amplifier 
biquad in Fig. 4c. 

(i) Biquads 4b and 4c share the same equivalent circuit (Fig. 5b). 
It will often be found that the equivalent circuit is a convenient 
mechanism for “untangling” single-amplifier biquads so that they may 
be analyzed in a straightforward fashion. Thus, the equivalent circuit 
provides a useful link between the single-amplifier biquad and its two- 
op-amp counterpart. 

(ii) It has been shown that to multiplex the op-amps within a two- 
amplifier biquad, the storage capacitors must be updated during dif- 
ferent clock phases. Hence, the use of either PcTs or parasitic-sensitive 
toggles'* seems unavoidable. 

(zit) The two transfer functions in eqs. (8a) and (8b) are available 
at the op-amp output on alternate phases. Sampling the output on the 
e-phase yields eq. (8a), while sampling on the o-phase yields eq. (8b). 

(tv) Damping is provided by a switched capacitor which has been 
called “F-type” damping.” Referring to the two op-amp biquad in Fig. 
4a, an alternative form of damping is obtained by placing an un- 
switched capacitor across switched capacitor C. This has been called” 
“E-type” damping. Unfortunately, E-type damping constrains the D 
and B charge updates to occur on the same phase. Consequently, E- 
type damping is not available in single-amplifier biquads. 

Now that the basic implementation has been derived, let us examine 
the integrity of the feedback loop as the switches open and close. Since 
the clocks are nonoverlapping, it is readily seen that there are periods 
of time during which the op-amp in Fig. 4c operates without feedback. 
Clearly, this is a potentially dangerous circuit condition. | 

An interesting but impractical way to maintain loop closure is to 
place an unswitched capacitor of value M across the op-amp as shown 
in Fig. 6a. Although the M capacitor solves the stability problem, an 
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(a) 


Fig. 6a—Single-amplifier biquad stabilization via continuous feedback—circuit sche- 
matic. 


analysis of this circuit reveals the impracticality of the solution. Since 
the results of the analysis are rather interesting we present them, in 
spite of our reluctance to recommend the M capacitive feedback. First, 
let us examine the equivalent circuit in Fig. 6b. Note that while in the 
actual sc circuit the M capacitor appears once, in the equivalent circuit 
it appears virtually everywhere. This is a consequence of the fact that 
M introduces coupling between B and D. Using the equivalent circuit 
in Fig. 6b, the transfer functions for the two-op-amp outputs can be 
easily derived. For this presentation we need only concern ourselves 
with output V%e, Le., 


02 —(A + M)Gz" 


Vi, {D(B+F)+M(B+ D+F) + M} — (2DB + DF 
—AC+M(B+D+A-—C)}z' + DBz” 


(9) 


Because of the coupling provided by M, the coefficient for the constant 
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(b) 


Fig. 6b—Single-amplifier biquad stabilization via continuous feedback—equivalent 
circuit. 


term of the denominator depends quadratically on M. In addition, M 
provides a leakage path for both B and D which can severely enhance 
the damping heretofore provided by F' alone. In order for M to have 
an insignificant effect on the circuit performance, DF > M(B + D), 
where B and D are typically large, while F' is near minimum value. 
Hence, if B and D are roughly equal, then M « F/2. Capacitor M is 
then too small to be practical. 

A practical solution which does not affect the biquad’s transfer 
characteristic is illustrated in Fig. 7. The sc network involving X and 
Y is seen to provide feedback around the op-amp when both clocks e 
and o are low, i.e., the dead zones of the nonoverlapping clocks. When 
either e or o is high, X, Y simply introduce capacitance to ground from 
the op-amp summing junction and output, respectively. The values of 
X and Y are relatively unimportant and can be set conveniently to one 
unit each. The crosstalk introduced by this arrangement is insignifi- 
cant. If the clock rise and fall times are reasonably fast, the oppor- 
tunities for crosstalk, 1e., interaction between X, Y and B, D, are 
minimal. Even if this is not the case, the interaction is self-compensat- 
ing. For example, if X, Y happen to pull out of the loop more slowly 
than D enters the loop, some fraction of the charge stored on D will 
leak onto X, Y. However, when the X, Y common junction is grounded, 
the charge which previously leaked off D is returned to it. This self- 
compensating property will be imperfect as a consequence of finite op- 
amp gain. However, for op-amp dc gains of 1000 or more, the interac- 
tion between X, Y and B, D will be negligible. Thus, the X,.Y sec 
network provides an elegant and practical solution to the op-amp 
stability problem. 
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Fig. 7—Single-amplifier biquad stabilization via XY sc feedback. 


IV. REALIZING SINGLE-AMPLIFIER BIQUADS WITH TRANSMISSION 


ZEROS 


The all-pole resonator in Fig. 7 can be easily generalized to realize 
transfer functions with transmission zeros by adding the feed forward 
circuitry shown in Fig. 8a. The z-domain equivalent circuit is given in 
Fig. 8b. The transfer function for each time slot can be readily derived 


from the equivalent circuit: 


—{D(K + I) — 2DK + DiI + J) — AG)z™" 


e 
o2 


+ D(K + J)z~*} 


P=. = DB+F)-QDB+DF-AQ)z"+DBz= 9 
and 
z M*{C(K +I)—G(B+F) 
T" aT — (C(K + J) — GB)z™"} (10b) 


~ Ve D(B+F)— (Q2DB + DF—AC)z + DBz~ 
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(a) 


Fig. 8a—Single-amplifier biquad which realizes the transfer functions in eq. (10)— 
circuit schematic. 


The design of this single-amplifier biquad follows along previously 
published design procedures” for two-op-amp biquads. Note the man- 
ner in which the delay property of the 1pcT has been exploited. 
Comparing the transfer functions T and T’, we observe that only T 
has a quadratic numerator. Hence, by focusing our attention on T, 
little generality and flexibility is lost. As in Ref. 12, we can simplify T 
by arbitrarily setting A = B = D = 1. At the end of the design process, 
one can adjust the values of A, B, and D to set the gain level of T’ and 
to independently scale the capacitances, which share common time 
slots. Setting A = B = D = 1 in eq. (10a) yields 

T= 02 —{((K+1)-(K+1I+J—G)z'4+(K+d)z7} 


= 1 
Vin (l+ F)-(2+F-C)z¢4+2z” me 
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Fig. 8b>—Single-amplifier biquad which realizes the transfer functions in eq. (10)— 
equivalent circuit. 


If the desired transfer function 1s given by 
m{y — «z7' + 6277} 


G ig elec 
1 — az + Bz” 


(12) 
the unscaled values for capacitors C, FP, G, I, J, and K are obtained by 
matching the like coefficients of eqs. (11) and (12). For the poles, the 
design equations are 


pat (13a) 
and 
1+f-a 
C = ——__——__.. 13b 
B (13b) 


To place the transmission zeros, the general design equations and a 
simple solution, are given in Table I for each generic filter type.” The 
generic filter types are listed in Table II for convenient reference. Note 
that the actual capacitor values G, I, J, and K are related to the scaled 
values G, I, J, and K by the relation 


K=(1+ F)x, (14) 
where 
x=G,I,d,K. 


Examining the numerator of T in eq. (11), one observes that real zeros 
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Table I—Zero placement formulas for T 
Filter Type Design Equations Simple Solution 
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on alternate sides of z = 0 are nonrealizable. As a result, the bilinear 
bandpass BP10 function is not realizable with this circuit. This is not 
a severe restriction since the other bandpass types are realizable. 

As noted previously, the synthesis equations result in unscaled 
capacitor values. To obtain properly scaled capacitors, one must first 
scale the level of Vo; to a suitable level. By adjusting A and D, the 
level of V2, (i.e., JT’) can be scaled without affecting V%2 (i.e., T’). More 
precisely, if the gain constant of T’ is to be scaled to 
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Table IIl—Generic biquad transfer functions 


Generic Form Numerator N(z) 
LP20 (bilinear transform) m (1 + 27")? 
LP11 mz (1+ 27!) 
LP10 m (1+ 27?) 
LP02 mz? 
LPO1 mz”! 
LP0OO m 
BP10 (bilinear transform) m(1—27')(1 + 27) 
BPO1 mz ' (1-27!) 
BP00 m (1 — 27’) 
HP m(1—2z')? 
LPN m (1— ez! +27), e>a/vB, B>0 
HPN m (1 — ez7! + 277), e<a/VB, B>0 
AP m (B ~ az + 27”) 
General m (y — ez”! + 027”) 

T’ > pT’, (15) 


then it is only necessary to scale 


1 1 
(A, D) > Fa Dj. (16) 
wo 





The gain constant associated with JT will remain constant under this 
scaling. 

Once satisfactory gain levels have been achieved, one can scale all 
capacitors associated with each time slot so that the minimum capac- 
itance corresponds to a single unit. The two groups of capacitors which 
are to be scaled together are 

Group 1: (C, D, G) 

Group 2: (A, B, F, I,J, K). 
Note that the capacitors in each group are distinguished by the fact 
that they are connected to the input node of the op-amp during the 
same clock phase. The groupings are easily identified in the equivalent 
circuit where the groupings are determined by the amplifier input they 
are incident on. 

To cascade single-amplifier biquads, it is important that the input 
be accepted by all feed-ins on the same clock phase. The biquad in 
Fig. 8 possesses this property. 

Note that the inputs to the single-amplifier resonator can be chosen 
differently from those selected in Fig. 8. An alternate realization is 
shown in Fig. 9. The transfer functions for this circuit are 


—-{(B+ F)(G+L) —[2BL + B(G+ A) 
a + F(H +L) + CIjz7' + B(H + L)z*} 


—s Vi, D(B+F) — 2DB + DF — AC)z + DBz™ ae) 


and 
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(a) 


Fig. 9a—Single-amplifier biquad which realizes the transfer functions in eq. (17)— 
circuit schematic. 


Von, me P{A(G + L) + DI- (A(H + L) + DIz™} 
VE, = D(B+F)-(2DB + DF-— AC)z! + DBz~ 


This circuit is not as general as the circuit in Fig. 8, hence, it will not 
be pursued further. 

It is well known” that, by sharing switches operating in synchronism 
between common nodes, the switch count for a given sc filter can be 
significantly reduced. Circuits involving PCTs can also share switches. 
Switch sharing for pcT branches is demonstrated in Fig. 10. Note that 
sharing switches and combining the toggling capacitors 2G and 2C, as 
shown in Fig. 10b, retains the parasitic cancellation property. In this 
operation, three switches are saved. Sharing of switches within the 


, 


(17b) 
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(b) 


Fig. 9b—Single-amplifier biquad which realizes the transfer functions in eq. (17)— 
equivalent circuit. 


i ee a > —< 
(a) 


Fig. 10a—Switch sharing among parasitic-compensated toggles—original circuit. 


biquads in Figs. 8 and 9 yield the switch-reduced circuits in Figs. lla 
and 11b, respectively. Also note that one of the switches associated 
with a storage capacitor can often be eliminated by judicious sharing 
with one of the single-pole-double-throw (SPDT) switches associated 
with a switched capacitor. For example in Fig. lla, the e switch on the 
top plate of B is shared with the e half of the SPDT switch on the top 
plates of F' and A. The fact that the top plate of B is switched to 
ground during o does not adversely affect circuit operation. However, 
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Fig. 10b—Switch sharing among parasitic-compensated toggles—switch-reduced 
equivalent of Fig. 10a. 
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Fig. 1la—Switch sharing in single-amplifier biquads—switched shared equivalent to 
the circuit in Fig. 8a. 
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Fig. 11b—Switch sharing in single-amplifier biquads—switched shared equivalent to 
the circuit in Fig. 9a. 


when sharing storage-capacitor switches, one must be careful to avoid 
inadvertently including storage capacitors in a closed loop of capacitors 
during their holding phases. 


V. LEAPFROG STRUCTURES 


The circuit techniques illustrated in the previous sections are readily 
applied to reduce op-amp count in leapfrog structures. The resulting 
structures are parasitic insensitive. As before, closed loop stability 
during clock dead zones can be ensured by using the feedback network 
of Fig. 7 for every op-amp. It is readily shown that Nth order leapfrog 
structures can be implemented with N/2 op-amps for N even and 
(N + 1)/2 op-amps for N odd. 

It is interesting to note that lossless discrete integrator (LD1) de- 
rived’ leapfrog structures are inherently multiplexable. An imple- 
mentation of a four op-amp, fourth-order low-pass leapfrog structure 
is shown in Fig. 12a. Note that all parasitic-sensitive toggle-switched 
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Fig. 12b—Fourth-order low-pass sc leapfrog filter—two-amplifier realization. 
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capacitors have been replaced by ects. Hence, the implementation in 
Fig. 12a is parasitic insensitive. In the implementation shown, op-amps 
1 and 3 update charge during the odd phase, while op-amps 2 and 4 
update charge on the even phase. To derive the two-op-amp imple- 
mentation, one simply multiplexes alternate time-slotted op-amps in 
a pairwise fashion. For the implementation in Fig. 12b, op-amps 1, 2 
and op-amps 3, 4 are pairwise multiplexed. One could alternatively 
pairwise multiplex op-amps 1, 4 and 2, 3. However, it appears that the 
interconnecting network is simpler if adjacent op-amps are multiplexed 
as is the case in Fig. 12b. To complete the structure, X, Y sc feedback 
circuits should be added to each op-amp. Switch sharing can also be 
used to reduce the switch count, but this will not be shown here. 


VI. CONCLUSION 


Circuit techniques which reduce by a factor of two the number of 
op-amps required to implement a given Sc filter, while maintaining all 
parasitic insensitivity, have been given. Both biquad and leapfrog 
structures have been considered. Although only all-pole leapfrog struc- 
tures were discussed in this paper, the techniques presented are readily 
applicable to elliptic-type leapfrog structures as well. 

Breadboards have been constructed to establish feasibility and to 
examine the severity of the stability problem. Our experience has been 
that some sort of feedback is required around the op-amps at all times 
and that the X, Y sc feedback circuit works as expected. However, to 
determine whether such op-amp multiplexed circuits are really prac- 
tical, sample designs will have to be realized in integrated form and 
characterized. In particular, the effects of op-amp slew rate and settling 
characteristics, as well as the noise and power supply rejection prop- 
erties of these circuits, need to be investigated further. 
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Moment Formulae for a Class of Mixed 
Multi-Job-Type Queueing Networks 


By H. HEFFES 
(Manuscript received August 26, 1981) 


Queueing network models play an important role during each stage 
of a computer system’s life cycle (from initial conception to system 
maturity), where in each stage broadly applicable performance anal- 
ysis tools are needed. This paper presents new results which contrib- 
ute to the foundations of a tool to support performance analysis and 
modeling activities. In dealing with some performance tissues, it is 
important to be able to quantify distribution or moment information, 
because these quantities can influence system capacity and service 
and performance measures. It is also important that models include 
the effect of congestion adaptive 1/0 devices, in a stable and efficient 
manner, for this inclusion can significantly affect the outcome of 
studying certain performance issues. We address the problem of 
direct, recursive computation of moments of the queue size distribu- 
tions at a class of service centers embedded in a mixed network of 
queues. The parameterized class includes state-dependent processing 
rates useful in modeling congestion adaptive 1/0 devices. We also 
present results for calculating moments of both the waiting time and 
virtual delay (work backlog) distributions at a class of service centers. 
In addition, we obtain a Little’s Law type of relation between delay 
moments and queue size factorial moments. For a class of networks, 
an algorithm is given for the direct, recursive computation of the tail 
of the node delay distribution. 


l. INTRODUCTION 


Performance analysis and modeling activities are essential for an- 
swering key questions at various stages of a computer system’s life- 
cycle, ranging from initial conception to maintaining and growing a 
mature system. Although both the questions asked and the analysis 
approaches may differ from stage to stage, in each of these stages 
broadly applicable performance analysis tools are needed to support 
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such activities. This paper presents new results which contribute to 
the foundation of one such tool: algorithmic techniques for efficiently 
solving a class of queueing networks. 

In dealing with some performance issues, it is important to be able 
to quantify distribution or moment information (e.g., delay variability 
as opposed to only the mean delay)’ because these quantities can 
influence system capacity and service and performance measures. It is 
also important that models include the effect of congestion adaptive 
1/o devices, in a stable and efficient manner, for this inclusion can 
significantly affect the outcome of studying certain performance issues 
(e.g., the impact of multiprogramming).”” 

In this paper, we present results for the direct recursive computation 
of moments of the queue size distributions at a class of service centers 
embedded in a mixed network of queues. The class of service centers 
allows us to efficiently treat, in a stable manner, a parameterized class 
of state-dependent processing rates useful in modeling congestion 
adaptive 1/o devices.**” By dealing with mixed systems, we allow 
consideration of systems with workloads from a finite population (e.g., 
a collection of terminals), multiprogrammed systems, together with 
_ workloads from basically infinite customer populations. We present 
results for calculating moments of the waiting time and virtual delay 
distributions at a class of service centers that enable us to quantify the 
variability of node delays, as well as work backlogs. 

The well-known class of multiple resource models, usually referred 
to as product form queueing networks,°® have been used to address a 
wide range of performance issues, such as capacity estimation and 
planning, bottleneck identification, performance prediction,’ memory 
interference, and software lockout in multiprocessor systems.® Usually 
the models used to address these issues have approximately included 
the factors of interest (e.g., priority processor scheduling disciplines).° 
A considerable amount of effort has been devoted to the study of this 
class of queueing networks, and efficient computational algorithms 
exist’*"° that allow one to obtain, for example, mean values of the 
desired quantities. While, in principle, the entire network queue size 
state description can be obtained from the above, one may be inter- 
ested in obtaining results, directly, for a more moderate level of detail, 
e.g., moments of queue sizes, as well as in quantifying variability of 
delays at a network node. Existing algorithms for calculating even only 
mean values can become much more complex and sometimes exhibit 
chaotic behavior."’ This situation arises, for example, when a state- 
dependent service rate is used to model a class of devices, such as an 
efficiently scheduled disk or drum,**”* whose efficiency is a function 


*See Ref. 5 for examples of secondary storage units that employ a scheduling 
algorithm which attempts to minimize rotational latency and/or seek times. 
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of the number of queued requests. The complexity results from a 
requirement that the entire* marginal queue size distribution is needed 
at each step of the computation.’ 

The results in this paper overcome some deficiencies in present 
methods for analyzing networks of queues and enable us to quantify 
important performance measures for which, previously, no efficient 
computational methods existed. In Section 1.1, we discuss how a tool 
for evaluating networks of queues can fit into a performance analysis 
and modeling methodology, and in Section 1.2, we give a more specific 
definition of the problems treated and outline the remaining sections 
of this paper. 


1.1 Use of network of queues models 


To illustrate the aforementioned need for broadly applicable tools 
and how a queueing network analysis tool can fit into a performance 
analysis and modeling methodology, we consider the various stages of 
a system’s life cycle. During the system’s conception stage, where one 
is concerned with the services to be offered, broad objectives—per- 
formance, reliability, etc.—basic architecture, proposed components 
and initial sizing, questions involving initial feasibility arise. Initial 
feasibility studies generally require several tools to address the follow- 
ing type of question: Given a proposed architecture and assumptions 
concerning the way the system is to be used (e.g., obtained from a 
gross workload characterization tool which may indicate various usage 
scenarios), how well can the system be expected to perform? The 
answer to this question often leads to a modification of the proposed 
architecture and/or what the system is planned to do. During this 
stage, where system specification is often at a macroscopic level of 
detail, a tool based on a network-of-queues methodology can be useful 
for the performance prediction. Here, for example, use of a tool 
incorporating a central-server’* queueing network model can be helpful 
in answering questions such as the effects of cpU size, number of disks, 
and the number of terminals that can be supported, while meeting 
broad performance objectives, as well as estimates of cost. The partic- 
ular form of input that may be available from a workload characteri- 
zation is particularly suitable for use by a network-of-queues method- 
ology at this stage. 

During a system design phase (when one is attempting design 
optimization), one may tend to focus more on subsystems, and models 
would tend to include more microscopic details, such as a disk schedule 
model or a CPU process schedule model. Although this stage generally 
requires more detail than is attainable with a network-of-queues model, 








* Some simplifications result when the state dependence disappears above a given 
loading,””° the so-called limited queue-dependent server. 
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issues such as the effect of the degree of multiprogramming*’ and the 
effect of a disk schedule which uses a SCAN! algorithm versus a FIFO 
discipline, can be treated. We note that at this stage it is important to 
have models which include, at least parametrically, the effect of a 
device being congestion adaptive since this can significantly influence 
the performance that can be expected.’ In this stage, distributional 
information can also be important, as opposed to just dealing with 
mean values, since performance criteria and service objectives may be 
in terms of the tail of a distribution. 

During the stage when one is dealing with and maintaining a mature 
system—adding features, growing, doing capacity estimation and plan- 
ning—a tool or methodology for viewing the overall system is desirable. 
Here, measurement tools, providing resource utilizations, workload 
characterizations, and response times, together with proposed changes 
in system use and growth, can be used with an overall system perform- 
ance model to answer such questions as Is the system adequate? How 
much more load can it handle? or, What system changes are required 
to handle a further load increase? A capacity planning methodology 
which accepts, say, maximum allowable utilizations on various system 
resources and measurements of current utilizations and workloads, and 
predicts allowable increases in load could use the output of a network- 
of-queues tool to determine what the maximum resource utilizations 
should be. We note that the determination of these levels, and thus 
the system capacity, could very well be based on distribution infor- 
mation. 


1.2 Outline of paper and summary 


In this paper we address the problem of recursively computing 
moments of the queue size distribution at a service center embedded 
in a class.of product form networks. Parameterizing the service center 
with respect to its state dependence allows us to treat, in an efficient 
manner, congestion adaptive models with either improved or degraded? 
efficiency. We treat mixed systems where the population corresponding 
to a given job type may be fixed (1.e., a closed chain) and where system 
requests corresponding to other job types may arrive exogenously from 
basically an infinite population (i.e., an open chain). Closed chains 


* Reference 2 also shows that the multiprogramming effect, representing an increase 
z potential throughput of up to 75 percent, can depend strongly on the disk scheduling 

gorithm. 

T Actually Ref. 2 considers the LOOK algorithm which is similar to SCAN. 

* Reference 3, which presents a performance comparison of two 1/0 access disciplines, 
notes that the relative comparisons, representing capacity improvements in excess of 
100 percent, can depend strongly on whether the 1/o device is congestion adaptive (e.g., 
a fixed head drum employing the shortest access time first schedule) or congestion 
independent (e.g., FIFO). 

§ Resulting, for example, from increased overhead with increased processor queueing. 
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arise when dealing with finite population models where, for example, 
a finite number of terminals places requests into a computer system. 
They also arise in modeling multiprogrammed systems where the size 
of the finite population corresponds to the degree of multiprogram- 
ming. 

Figure 1 shows an example of a mixed system where the closed 
chains correspond to each of the terminal groups, each group with 
possibly different think-time distributions and routing and service 
requirements (workloads). Figure 2 shows a closed network model 
representing requests arriving to a system, S, over a finite collection of 
access trunk groups. A request is blocked if all trunks in the group are 
occupied. Note that a trunk is held during the entire time the request 
is in S. When a trunk is available in group 1, requests enter S from 
group 1 at rate 4; (the Poisson arrival rate to group 7). The population 
of chain 2 is K;, the size of trunk group 1, and the blocking probability 
at group 1 corresponds to 1—utilization of the node with service rate 
\;. In addition to throughputs and resource utilizations as the usual 
quantities of interest, other quantities of interest may be queue size 
moments, and the distribution, or moments, of the time from trunk 
activation to its first entry into the cPuU (reaction time). 

We treat the problem of recursively calculating moments of the 
waiting time and virtual delay distributions at first-come-first-served 
(FCFS) state-independent service centers, which enables us to get a 
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Fig. 1—Mixed system illustration. 
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Fig. 2—Closed system illustration. 


handle on the variability of nodal delays and to estimate the distribu- 
tion of such delays. With knowledge of the virtual delays, we can 
compare performance, for example, as viewed by arriving customers to 
that viewed by an outside observer. We can thus estimate performance 
measures such as the system reaction time’ distribution, which is a 
measure of the time between a terminal request and this request first 
getting the attention of the cpu. In another application, it could 
correspond to the time between a request coming into a system over 
an access trunk group and the start of processing. 

In addition to obtaining algorithmic results, we obtain a relation 
between moments of node delays and queue sizes at a class of FCFS 
service centers. We finish with an algorithm for the recursive compu- 
tation of the tail of the nodal delay distribution at a FcFS service center 
embedded in a network consisting of either single-server, state-inde- 
pendent nodes or infinite server nodes, e.g., the central server model 
of multiprogramming.” 

The organization of this paper is as follows: In Section II, we present 
the class of networks under consideration, including specification of 
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the job-type characteristics and specification of the class of service 
centers. In Section III, we consider closed systems and present results 
for the factorial moments of queue size distributions, for different 
levels of customer aggregations. The aggregations considered are the 
total number of jobs of a given type at a network node and total 
number of jobs of all types at a network node, the former requiring 
consideration of joint factorial moments and correlations. Results for 
mixed systems are given in Section IV by considering a mixed system 
as the limiting case of a multi-job-type closed system, formed by 
augmentation, as the population increases and the augmented node 
becomes the bottleneck. 

The recursions for nodal delay and flow time moments are presented 
in Section V for closed systems and in Section VI for mixed systems. 
Delay results are obtained as experienced by arriving customers or as 
experienced by an outside observer 1.e., the virtual delay. Delay distri- 
bution results appear in Section VII, and the appendices contain 
details of the investigations. 


ll. CLASS OF NETWORKS: SERVICE CENTER AND 
JOB CHARACTERISTICS 


We define the structure of the networks in terms of the types of 
service centers or nodes and the job-type characteristics in terms of 
their routing through the network, their service requirements per 
service center visit,* their populations for closed job types, and their 
exogenous arrival rates for open job types. 

We consider a network with R types of customers (jobs, chains) 
where s = 1, 2, --- , r correspond to closed chains and s = r + 1, 

, R correspond to open chains. The nodes in the network are either 
single-server nodes, using a FCFS,' processor sharing or last-come-first- 
served preemptive resume (LCFS-PR) queueing discipline or infinite 
server nodes. At the single-server nodes, the processing rate can 
depend on the total number of customers present. 

Customers of a given job type (chain) may change class membership 
as they traverse the network but always remain in the same chain. 
Allowing customer class change allows one to model a broad class of 
routing scenarios including a deterministic, fixed sequence of node 
visits and also allows for different visits to, for example, a processor 


* Job characteristics can also be defined in terms of the workload requirements, the 
average resource usage/job lifetime for each resource where job lifetime is suitably 
defined for closed customer types. 

' Other types of customer selection rules, not depending on actual customer service 
pouuements (e.g., random selection, LCFS nonpreemptive) can also result in product 

orm 
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sharing node by a given customer to have different service requirement 
distributions. 

For the open chains, s = r + 1, ---, R, we let Aiocs = rate of 
exogenous arrivals of chain s customers, with class membership c, to 
node i. Exogenous arrivals of customers, of a given chain-class pair, to 
a particular node are assumed to be Poisson.” 

If we let piic = the probability that a chain s job completing service 
at node & as a class 7 customer will next enter node i with class 
membership c, then the actual node-class-chain flow rates Ai... satisfy 
the so-called traffic equation, 


Nowe Be Noes 2 i ») Ar, sDeLic} 
(RDETs 


s=rt+1,---,R; (tz, c) E Is, (1) 


where I; is the set of feasible node-class pairs for chains s jobs. 
We note that for each closed chain, the node-class-chain rates satisfy 


Ness = y Ar,sDELic3 


(k,DETs 
S=sl,+-+:,r} (tz, c) € Ig. (2) 
The closed chains are further specified by their population, 
K = (Ki, Ko, --- , K;), (3) 


where K, denotes the system population corresponding to the closed 
chain s. We denote the rate of customers of a given chain, s, flowing 
into a node as 


Vis S > Nive (4) 


cEC;(s) 
where 
Ci(s) = [c; (i, c) € Is]. 


We assume that for each open chain the traffic equation (1) has a 
unique solution! for the traffic rates 


(Aics; (t, c) © Is); s=rt+1,---,R, 


and that every closed chain has an irreducible routing matrix so that 


* This does not preclude having a large class of state-dependent arrival processes, 
including examples where customer sources are turned off or blocked when their system 
population reaches a threshold. These can easily be transformed into a mixed network 
of the type being considered. 

*This precludes certain pathological cases where customers enter the system and 
never exit. 
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the solution to (2) is unique up to a scalar.* Thus, the actual flow rates, 
for a given job type, can be specified by an arbitrary solution to (2), 
yielding relative arrival rates for classes in a closed chain, and a 
proportionality constant. 

We note that the actual proportionality constant, for a given closed 
chain, say A,(K), depends on the population vector K and that the 
actual flow rates, Aics(K), can be written as 


Nies(K) =Xs(K)Aics, (i, c) E I. (5) 
Thus, the actual node-chain flow rates A;;(K) can be written as 
is(K) = \s(K)Ais, (6) 


where Aj; is given by (4). 
The assumptions on the service time distributions are those for 
which the product-form solution holds (see Ref 6). We denote 


Us = Mean amount of service required by a chain s class c customer 
at node z when only one customer is present at node z (note that 
at a FCFS node it 1s required that pics = ier for all c, s, e and f), 


and 


uj(k) = the processing or service rate of node i when a total of k 
customers (regardless of type) are present.’ 


Clearly, the average service requirement of an arbitrary, with respect 
to class, chain s customer, denoted yj,', is given by the weighted 
average of the individual class average service requirements. 


Nies 


me (7)* 
cEC;i(s) Nis iu 


Hiss = 
While we allow the network to contain nodes with general state- 


dependent processing rates, we focus on getting queue size moments 
at nodes with a parameterized class of processing rates defined by 


* This follows from the assumption of irreducibility of the routing matrix 


P= [ Dizicl; kl, ice 1a 


and precludes the pathological case where the set of communicating node-class pairs 
can be decomposed into disjoint subsets. We note that the matrix can be made 
irreducible by using enough chains. 
tFor example, the number of seconds a processing per second of elapsed time. 

Without loss of generally we have let (1) = 

* Recall that these are not adjusted for CG isceeaias processing rates. Another 
interpretation of (7) is that pi! represents the average service time of an arrival to node 
i in a system whose only customer is a single chain s customer. 
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Fig. 3—A class of state-dependent service rates. 


k 


i(k) =———_—___, 
pal ) Aio + ank 


(8)* 
where @;,; = 0; dio + Gj, = 1. We note that a;; = 1 corresponds to a 
single server with state-independent processing rate, while a; = 0 
corresponds to an infinite-server node, often useful in modeling a finite 
population source—a collection of terminals—or a random delay. 
These are shown in Fig. 3, along with cases 0 < ai, < 1 and a; > 1 that 
can be used to represent congestion adaptive devices. We also note 
that the family of service rates may also be useful in approximating 
subsystems with restricted entry’””’ by a single state-dependent node 
as is done in multiprogramming models.’’ What is needed here is the 
subsystem throughput as a function of the sizes of its populations, and 
recursive methods are naturally suitable for obtaining this. 


ll. QUEUE SIZE FACTORIAL MOMENTS—CLOSED SYSTEMS (R= r) 


We present recursions for two different levels of customer aggrega- 
tion: (z) for the total number of jobs of all types at a network node, and 
(zz) for the total number of jobs of a given type, the latter requiring 
consideration of joint factorial moments and correlations. 

We denote 


* The methods of this paper can be extended to include a generalization of this family 
of processing rates. The above family provides a good fit to the empirically obtained (by 
R. J. T. Morris) state dependence used in Ref. 2 for an efficiently scheduled (similar to 
SCAN) moving head disk over the multiprogramming range considered. 
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Biy(K) = EL Rikki — 1) --- (Ri -~7 + 1); K] (9) 


as the jth factorial moment of the total number of customers, k;, at 
node i at an arbitrary time in equilibrium, for a system with population 
vector K. Appendix A shows that £,(K) satisfies the following recur- 
sions 

. Nis(K) 
Bi(K) = x a {ain Biy(K — es) 


+ [1+ (J — DaalBiji(K —e,)},  (10)* 


with initialization £,(0) = 0; 7 > 0 and £;,.(K) = 1. The quantity e, is 
a unit vector in direction s. We thus obtain the jth moment at popula- 
tion K by updating the jth moments corresponding to systems with 
one less customer for each chain in addition to similarly including the 
effect of the (7 — 1)st moments. The node-chain throughputs, 
Ais(K), are available via standard mean-value analysis; however, the 
standard analysis requires computations!’ of marginal probabilities at 
nodes with state-dependent processing rates. For networks, with nodes 
of the type under consideration, we can generalize mean-value analysis, 
in a stable manner, without ever computing marginal probabilities and, 
furthermore, obtain the higher order moments. When / = 1, (10) yields 
the generalization 
R 
pak) = 5 


s=1 is 





 TanBa(K — e,) + 1]. (11) 


Defining the mean node flow time for a chain s customer as the mean 
time a chain s customer spends at node i per visit* to the node i queue, 
T;;(K), Appendix A shows that 


T,,(K) = — ipa Ss) (12) 


The mean number of chain s customers at node i 
Bi1,s(K) = EB (Ris; K) 
satisfies 
Bas(K) = Ais(K)T;.(K), (13)8 
where the (i, s) throughput satisfies [see (6) | 


* In all recursions, terms with a negative population component are zero, e.g., K — es; 
with K = 0. 

* Those computations can become unstable. 

*If a customer is fed back to the same queue after a service completion, as in the 
central server model of multiprogramming, this corresponds to starting a new flow time. 

§ The relation with (12) is via Little’s Law for chain s customers at node i. 


11,18 
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Initialize Bi (0) =07>0, B;,(0) = 1. 

Loop on K until desired population K*. 

Compute node flow time means, 7;,(K), from (12), /eN(s/, 5 = 1,2,...,R. 
Compute throughput proportionality constants, X g(K) from (15),s = 1,2,... 
Compute node-chain throughputs, d, (K), ie NM(s), $= 1,2,...,R, from (14). 


Compute B;, .(K) from (13), /eM(s),s=1,2,..., R and B;,(K) from (16). 


If do not desire Bi (K) j>10or Byig(K) j > 1,go to 2. 


If desire B;,(K) J >1,compute B;(K) from (10) at desired nodes for 1</<JVJ. 
If do not want Bis(K) j >1,go to 2. 


If desire Bj yg(K*): lf K=K*—Je, initialize Y, 0,2, g(K) = Byg(K), 2=0,1,... 


If K = K*—ne, n < J,compute Yi y—nostK) from (21) as shown in Fig. 5. Go to 2. 





Fig. 4—Algorithm for queue size moments—closed systems. 


Ais(K) = AisAs(K). (14) 
In terms of the flow times, the chain s proportionality constant is 
\;(K) ee. eee: s=1,2,---,R, (15) 
Y Nis T'is(KX) 
iE N(s) 


where N(s) is the set of nodes visited by chain s customers. To close 
the algorithmic loop, we use 


Ba Ba(K) = 5 Bias(K). (16) 


The algorithm (12) — (15) — (14) — (18) — (16) (see Fig. 4), with 
initial condition B;,(0) = 0 and with the relative traffic rates, A;;, as in 
the previous section,* represents a simple, stable’ modification of 
mean value analysis for the desired class of state dependencies. With 
the node-chain throughputs from (14), the aggregate higher order 
factorial moments are obtained, in a numerically stable manner, by 
(10). We note that, unlike the mean values, it is only necessary to 
compute higher order moments at those nodes of interest. Thus, at 
the desired node we have the algorithm (see Fig. 4) (12) — (15) > 
(14) — (13) — (16) — (10). We now turn our attention to obtaining the 
higher order moments for the lower level of aggregation. 

To obtain the higher order (>1) factorial moments of the number of 
each type of customer at a node 


* We recall the relative traffic levels need not be computed for each population vector 
since they do not depend on K. 
* Note that no subtractions appear in the computations and recall aj => 0. 
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Bijs(K) = El Ris(Ris — 1) +++ (Ris — 7 + 1); KY], (17) 
we consider the joint factorial moments 
yijis(K) = E[Ris(Ris — 1) +++ (Ris — 7 + 1) 
-ki(ki — 1) +--+ (Ri ~1 +1); K]. (18) 
Note that 
Yio1s(K) = Bi2(K), (19) 


the high level /th factorial moment defined by (9) and computable 
from (10), and 


¥i,j,0,sK) = Bi7,s(K) (20) 
the desired low level jth factorial moment. 
From Appendix A, we have the recursion 
Nis(K) 


+ (1 + 2lain)yij-1,1,s(K — es) + [1 + (1 + aa) (Z — 1) 
+ ain(Z — 1)7]yi;-10-1,s(K — es) }; J>0. (21) 


For j = 0 the computation is made via (10) and the identification (19). 
We note that for 7 = 0, 7 = 1 (21)' reduces to (13). Making the 
identification (20), gives 


Ais(K 
Bijs(K) = —_ [ Bij-1,s(K . e;) + Qinyij—-1,1,sK — e,) |. (22) 


Yi, ius(K) = 





{ @inVij-1,141,s(K — e,) 


We note that if we are interested in the Jth factorial moment of the 
number of chain s customers at node 7 for a target population K%, 1.e., 
Biss(K*), then (21) and (22) are initialized at population K* — Je, with 


Yio,,s(K™ — Jes) = Bi(K* — Jes), 
£=0,1,---,d, | (23) 


the simple high-level aggregation result. We further note that (21) and 
(22) only need be updated along parameter direction e,. This is shown 
schematically in Fig. 5 and the steps in the algorithm in Fig. 4. If, for 
example, one is interested in a mean and variance analysis, this 
specializes to 

Ais(K*) 


. 


is 





Bizs(K*) = [Bis(K* — es) + airyiris(K* —es)], (24) 


* Terms with a negative subscript are zero. 
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Fig. 5—Schematic for moments of low-level aggregation—closed systems. 


where the correlation is given by 


Ais(K* ~~ 


@s 
ile Se ) TanBo(K* — 2e,) 


is 


+ (1 + 2an)Ba(K* — 2e,) +1]. (25) 


IV. QUEUE SIZE FACTORIAL MOMENTS—MIXED SYSTEMS 


To obtain the mixed network results, we consider an augmented 
closed system which contains an external node for each open chain. 
The service rate of an external node corresponds to the system arrival 
rate for the corresponding open chain, i.e., Aos = Viens, Atos} $= r+ 
1, ---,#. When customers depart in the original system, they are 
routed to the appropriate external node in the augmented system. A 
departure from a given external node, say, corresponding to the open 
chain s, is routed to node 7 as a class c customer with probability A.ocs/ 
Aos- Taking the limit of the closed network results as the populations 
of chainsr + 1, --- , R — o, and assuming the external nodes become 
the bottlenecks,’ we obtain the desired recursion for the high-level 
aggregate factorial moments 


‘Note that if an internal node becomes a bottleneck, the original mixed system is 
unstable. 
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0 
By(K) — [1+(/-])a;] T_. an Bi j-1(K) 


LoS {an Bi(K — es) 
+ [1+ (j— l)an]B;;.(K —es)}, (26) 
where 
Ns 


s=r+1 is 


pi = (27) 
is the unadjusted utilization [unadjusted for state-dependent service 
rates pi(k) # 1] at node 7, corresponding to all open chains. We note 
that the stability condition is given by 


pid <1, (28) 
where we note that p;aj: is the limiting (as k; > ©) utilization due to 
customers belonging to open chains. Equation (26) is initialized by 

Bio(K) =] 
and 


1+ (7 — Iaalp? 
Bi(0) = et al Bi;-1(0), (29) 
— Qiipi 
the open network factorial moments. For 7 = 1, (26) yields 
: Ais(K) 
: aia [1 + aaPia(K —es)], (30) 


n(K) =—————__ 
pa (l— ap?) = Bis(1 — anpi 


with initial condition 


Bir(0) Se = (31) 
1— Qiipi 


The required node-chain throughputs, A;;(K), in (26) can be obtained 
via a standard type of mean analysis. However, as before, we are 
interested in a generalized analysis that does not involve marginal 
probabilities. These throughputs are obtained via the limiting argu- 
ment which yields the recursions 


is(K 
Bas(K) = we [1 + anBu(K — es) ]; sSr (32) 
for the closed chains and 


Bias(K) = ~ [1 + aiBa(K) J; s>r, (33) 


for the open chains. We note that in (32), Ais(K), the closed chain s- 
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node z throughput, is to be solved for, whereas in (33), Ais is simply the 
solution to the traffic equation for open chain s. Summing (33) over all 
chains, we obtain 


o(K) = Ph RE 
4K) = agg ll + ani), (34) 
where 
R 
BRK) = 2 Bis(K), (35) 
Ba(K) ae x Bias(K), (36) 


which relates the mean value of the total number of open customers 
at node z to the mean value of the total number of closed customers at 
node i. To obtain the desired recursion for Bis, s S r we use (34) in 
(32), which results in 


Nis(K) 
i1s(K) =—————_——_ [1 + an Bi(K — e.)], <7; 
Bi1,s(K) wis(1 — aap?) [ ainPir( es) | s=Pr (37) 
and from Little’s Law, the mean chain s flow times 
1 
Tis(K) = —_———_— [l 2 anBba(K = es) |, s=r. (38) 
Lis(1 — ainp?) 


The node chain throughputs are calculated from 


where, \,(K), the chain s proportionality constant (independent of 2) 
is given by 
K; 


MK) = STK) 


S=r. (40) 


The algorithm, 
(38) — (40) — (39) — (37) — (86), 
(see Fig. 6) with initial condition 
(0) = 0, 
is identical to the algorithm (see Fig. 4) for closed systems, 
(12) — (15) — (14) > (18) — (16). 


However, the closed chain service rates have been adjusted to account 
for the presence of the open chain customers, 1.e., 


[is = pis(1 — ainp?). (41) 
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Initialize B;,(O) from (31) and B;;(O) from (29) for 1< / S desired order moment. 
rr (O) = 0. 


Loop on K until desired population. 


Compute node flow time means, T;,(K), s$r 
(i.e., for the closed chains) from (38) for /eN/(s). 


Compute throughput proportionality constants Xr s(K) from (40) fors <r. 
Compute node-chain throughputs, Aj ,(K), s&r, from (39), /€N(s). 
Compute 6;,,(K) from (37) for s Sr, ieN(s) and Bi, (K) from (36). 


If desire Be, (K) or B;, (K), for some s > s, use (34) for B2. (K) and (33) with B,(K) 


given by the sum of BE (K) and Be. (K). 


If do not desire BAK) i >1or Bi (K) > 1,g0 to 2. 


If desire B; (K) J > 1, compute B.(K) from (26) for 1< /$ J. 
If do not want By(K) i > 1,g0 to 2. 

If desire B;),(K*) sSr:if K=K*—Je, initialize Y;,9,(K) = Bjg(K), 2=0,1,...,¥. 

lf K= K*—ne,,n<J Compute Y; ,_, Q s(K) from (42) as shown in Fig. 5. Go to step 2. 


At the desired population K*, if desire By ,(K*) fors>r, initialize 
Yjogs(K*) = BygiK*) L=0,1,...4. Compute ¥; pp 9 s(K*) from (45) 


as shown in Fig. 7. 





Fig. 6—Algorithm for queue size moments—mixed systems. 


We note, however, that the adjustment is not directly related to the 
actual open-chain utilization (as in the state-independent case’’), but 
rather to the limiting open-chain utilization. 

The above algorithm, together with (33) and (34) describing the 
open chains, represents a simple, stable modification to a standard 
mean value analysis for mixed systems with the desired family of state 
dependencies. 

With the node-chain throughputs from (39), the aggregate higher 
order factorial moments can be recursively computed in a numerically 
stable manner from (26). We note that, unlike the mean values, it is 
only necessary to compute higher order moments at those nodes of 
interest. Thus, at the desired node we have the algorithm (see Fig. 6) 


(38) — (40) — (89) — (37) — (86) — (26). 


We now turn our attention to obtaining the higher order moments for 
the lower aggregation level. 

To obtain the higher order factorial moments of the number of each 
type of customer at a node, £;;,(K) defined by (17), we require the joint 
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factorial moments y;;:;(K) defined in (18), noting the relations (19) and 
(20). Treating the mixed system as a limiting closed system, we obtain 


— 


Kis(K) 


+ (1 + 2lai) yi, j-1,25(K — es) + [1 + (1 + aii) (Z — 1) 
+ ain(l — 1)? }yi,;-1-1,(K — es)}; J>0;ssr, (42) 


which is the same form as (21). For / = 0, 7 = 1 (42) reduces to (32). 
For j = 0, use (19) and (26). Making the identification (20), we get 


Nis(K) 
Lis 


Yi,j4s(K) = 





{in yij-1,141,(K = e;) 





Bi,;.s(K) = | Bij-1,s(K = es) 


a Qi Yi, j-1,1,s(K = es) |, sr. (48) 


If we are interested in f;,.,;(K*); s = 7, then (42) and (43) are initialized 
at population K* — Je, with 


¥i.0,1,s(K* — Je;) = Bu(K* = Je;); L= O,1,---, J, S=r (44) 


the high-level aggregation result. Thus the computation proceeds as in 
the closed system case shown in Fig. 5. 

For the open chains, the limiting system argument results in the 
relation 


Vis 
Yi, js(K) = this {ainyi, j-1,1+1,s(K) 


+(1+ 21ai1) yi, j-1,4s(K) + [1 + (1 + ais)(l — 1) 
+ all = 1)*)] vi, j-1,2-1,s(0K)} J > 0, S>T. (45) 


At 1 = 0,7 = 1 (45) reduces to (33) and for 7 = 0, use (19) and (26). We 
note that (45) is not a recursion in K and can be evaluated as an 
aposteriori computation. Thus, if it is desired to compute £;,.7;(K); 
s >r ata fixed desired population vector K*, all that is needed is the 
high-level aggregate B;(K*); 7 = 0, 1, ---, J, which starts off the (45) 
computation. This is shown in Fig. 7 and the steps in the algorithm in 
Fig. 6. As in the previous section, it is a straightforward matter to write 
down the expressions needed to do a mean and variance analysis. 


V. DELAY AND FLOW TIME MOMENTS—CLOSED SYSTEMS 


We consider the problem of obtaining recursions for the moments of 
the time a customer spends at a FCFS, single-server, state-independent 
node embedded in a closed network. As earlier, we define the node 
flow time as the length of time an arriving customer, either arriving 
from another node or being fed back from the node output, spends 
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e TO GET Bids (K*) = YiJos (K*) ;S>r 
* DENOTE Yijgs BY (/,2) 


POPULATION (7,2) 





(1,0) (1,1) (1,2) ee ede (1, J-1) 
| ee se hes 
we SN oo Be 
K* (0, 0) (0, 1) (0,2) © 8% (0,0) eee (0,41) (Oy) 
y 
/ 
/ 


e INITIALIZE WITH HIGH-LEVEL MOMENTS ~— Bo (K*) 


Fig. 7—Schematic for moments of low-level aggregation mixed systems—open chains. 


until he next exits the server. The node delay is correspondingly that 
portion of the flow time spent in the queue. We also present a Little’s 
Law type of relation between moments of the node flow times and 
queue size moments. 
Denoting 7;;,(K) as the 7th moment of the flow time for an arbitrary 
chain s customer at node i, Appendix B shows that 
rR< , 
Tijs(K) = >) eRe) Tij(K — es) + 2 Ti;-1(K); j21, (46) 
l=1 Bi Bi 
where the quantity ij’ is the common mean service time at node i 
(i.e., fi = Bis = pv).’ From the initial conditions with respect to j 


Ti,0,s(K) sr i (47) 
and (46), we have. 


2 
Tijs(€s) = aa (48) 


We note that an alternate form of (46), which only involves the flow 
time moments corresponding to the job type of interest, is obtained by 
recognizing that 


Tii(K es) = Tis(K en e:). 


t Required at a FcFs node to have a product form solution. Since node i has a state- 
independent service rate, pi{k) = 1. 
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It can similarly be shown (using the approach discussed in Appendix 
B) that the 7th moment of the delay distribution corresponding to a 
chain s customer satisfies 


wie 
l=1 i 
Wiy(K am es) t - Wis 1,s(K); J > 1, (49)* 
and 
x ri K — s I K s 
Wis(K) = > ey WaiK — es) += .y  Au(K ~ es) (50) 
l=1 i [i i=1 [hi 
for j = 1. 


As with the flow time moments, the delay moments at a given 
population are formed by updating the same order moment for reduced 
populations, together with including the effect of the lower order 
moment at the same population. We, thus, note that 7th moment of 
the delay and node flow time distributions satisfy the same recursions, 
(46) and (49), for 7 > 1, the difference lying in the boundary equation 
(50), which results in 


Wi. 7,s(ez) = 0; J>0. (51) 


It is not necessary, however, to have a separate computation for 
W,;;(K) since W,;, is obtainable from 7;;, by the relation 
= ri (K _ es) ‘ 
Wis(K) = > ca Ty(K—-es); js>o. (52)' 
[=] i 
We note that the distributions of delays, as seen by arriving cus- 
tomers, will usually differ from the work backlog at a node at an 
arbitrary point in time (i.e., the virtual delay) as might be measured 
by an outside observer. If we define 


Vi;(K) = E(Wi(t); K), (53) 


where W(t) is the work present at node z at an arbitrary time in 
equilibrium, as the jth moment of the virtual delay, then it is shown 
in the next section that V;;(K) can be obtained from the moments of 
customer delay using the relations 


* The computation of these delay moments or the preceding flow time moments are 
easily included in the algorithm shown in Fig. 4. 
‘Note that for 7 = 1, this simply states the known result Wi.(K) = Bii(K — es)/pi. 
Relations (49), (50), and (52) can be written in a form containing only moments 
corresponding to the job type of interest by using W,;,(K — es) = Wijs(K — ez). 
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ae (K) 





Vi;( ) = Wil) += Vy (K); j>1  (54)* 
f=1 i 
and 
Va(K) = ¥ WS (2 Fs WalK) ) (55) 
l=] Bi pi 


Having obtained results both for the queue size factorial moments, 
8:(K), in Section II] and the ordinary moments of the flow time 
distribution 7;;;(K) (see above), we now present a relationship between 
them analogous to a well-known Little’s Law type of relationship for 
the M/G/1 queue in isolation. Appendix B shows that these quantities 
are related via the j-fold summation 

R R 


By(K) = X --+ Y Ais, (K)Ais, (K — @s,) 


$)=1 
= j-1 jJ-1 
: XK =), e, T1,(K -» e. : 
q=1 q=1 
The special, single chain, case R = 1 gives 


K 
by(K) = ( Il Rd JK =7 + 1); (56) 

l=K-j+1 
where A,(Z) is the node i arrival rate when our closed system contains 
Z customers and 7;,;(K — 7 + 1) is the 7th moment of the node i flow 
time when our closed system contains K — 7 + 1 customers. Relation 
(56) is analogous to a known result’ for an M/G/1 queue in isolation 
which states 


Bj = NT;. (57) 


Noting that ’ is replaced by the product of flow rates, each with a 
different population, we see that (56) generalizes (57) in the sense of 
holding for a queue embedded in a closed network with the correspond- 
ing, complicated arrival process. 


VI. DELAY AND FLOW TIME MOMENTS—MIXED SYSTEMS 


To obtain the mixed network results, we consider the augmented 
system as in Section IV. We obtain recursions for the delay and flow 
time moments at a FCFS node with a single state-independent server 


* As pointed out in Section VI, the virtual work results also apply to a limited class 
of processor sharing or LCFS-PR nodes; the limitations being that the service times are 
exponential with pis = wu (i.e., the same assumptions being made for the FcFs nodes). 
Inclusion of this computation in the closed network algorithm shown in Fig. 4 is 
straightforward. 
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embedded in a mixed network. Results are obtained for delays expe- 
rienced by customers in each of the closed chains and for customers in 
any™* of the open chains, thus, yielding moments for the virtual delay 
or unfinished work at a node. 

Denoting p;’ as the common mean service time at node i (i.e., fi; = 
Lis = pit)’ and p? as the actual utilization at node i due to customers 


belonging to open chains, 


Bhs 
ol (58)? 
rc i 
we obtain 
" Aun (K — es) 
Tys(K) = ) = —— Ty (K —- 
oA) = 2 pny 
+27,;.(K) +2 _ 14; (Ke); ss, (59) 
Li ui(l — pi) 


where 7?;(K) is the 7th moment of the flow time experienced by 
customers belonging to any open chain at node 7 and satisfies 


Ni (K) J 
7) K 97 ee el 
mo * ap 


The recursion (60) is initialized with 

pi(1 — p?) [wi(1 — p?)]?’ 

the open network flow time moments, and (59) is initialized with 
Tys(es) = TG(0). (62) 


The initialization (62) can be obtained by recognizing that the distri- 
bution of the number of customers found at node 1 by the single closed 
chain customer is identical to the distribution of the number at node 
it at an arbitrary time point, in equilibrium, in a system with no 
customers belonging to closed chains, i.e., K = 0. Since the distribution 
at an arbitrary point in time is identical with that seen by an arbitrary 
open customer, we have (62). 

The results for the moments of the delay distributions can be found 
using a similar argument. For the closed chains, this results in 


T?(K) = a 7 T?;.(K). (60) 


T3(0) = Ti j;-1(0) = (61) 


* We note that from Ref. 20 it is easy to show that delays experienced by customers 
belonging to different open chains have the same moments. 
Recall that this condition is required at a FCFS node in order to have a product form 
solution. 
* Since node i has a state-independent processing rate, ui(k) = 1 and p? is the actual 
utihzation as indicated. 
§ An alternate way of obtaining (62) is by use of (59) at K = e,, use of (61) and 
induction. 
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c dj K — e, 
Ws(K) = ») pee Wii(K — es) + - Wis i,s(KX) 
l=1 pil — pi) 
Joi We. K - < . 
nL — 00) pai es); ssr,; jJ>1, (63) 


where W? ,(K), the 7th moment of the delay experienced by customers 
belonging to the open chains, is also the 7th moment of the virtual 
delay or unfinished work at node 7 at an arbitrary point in time. The 
corresponding result for 7 = 1 is 
Was(K) = > Sa. (= + Wii(K — e.) 

im. Pi(l — p?) \Bi 


aay sxrjj=l1. (64) 


The moments of the virtual delay, or delays experienced by open 
customers, are given by 





" hn(K 
Woh) = eg i) Wij (K) 
i=1 Pill pi) 
+——" _ W?-(K);  f>1, (65) 
wi(l — p ?) 
and for 7 = 1 
"  An(K | ae 
4(K) = ¥ ——_ (= + Wank) ) + : =<=3 j=l. (66) 
i=. Pi(1 — p?) \pi 1 — pi pi 


We note that for both the open and closed chains, the 7th moment of 
the delay and node flow time distributions satisfy the same recursions, 
(59) and (63), for the closed chains and (60) and (65) for the open 
chains, the difference lying in the boundary equations (64) and (66). 
The initial conditions 


Wii (0) = 0; L=r, 
together with (66) and (65), give 





: oO oO J 
J Pi Pi 
Wi 0 =S: WwW? 1(0) = 7! eer pee re ME (67 
iA ) itz 1 — p? J i ) J ao ) 


the open network result. Using the same argument as was used for the 
closed flow time moments, we have 


Wiyjs(es) = W7(0) (68) 


which can be used to initialize (63) and (64). It is not necessary to have 
a separate computation for W,,;,(K) if the flow time moments have 
been computed since they are related by 
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Wijs(K) = » 


l=] Li 


" Au(K — es 
ee ik — e;) 


+ p?T§(K-—e,); ssr_ (69) 
and 
" Au(K 
y(K) = » “ Tyi(K) + piTy(K);  s>r (70)* 
i=. hi 
for the open chains or virtual work. 


To obtain the moments of virtual delay for closed systems, denoted 
by V;;(K), we observe that 


Vi(K) = lim W3(K), 
pie 
which results in the relations given in (66) and (67). 

As a final note here we observe that in addition to applying to the 
FCFS nodes as stated, the virtual work results, Wj, V., apply to a 
limited class of processor sharing or LCFS-PR nodes!’ the limitation 
being that service times are exponential with rates ui; = win =p; . This 
follows from the insensitivity of the stationary queue size distribution 
to the queueing discipline for this case and the memoryless property 
of the exponential distribution. 


Vil. DELAY DISTRIBUTIONS 


We consider a multi-job-type closed? network containing either 
state-independent service centers or infinite server nodes, e.g., central 
server model of multiprogramming, and present results for the tail of 
the delay distribution experienced by a type s job arrival to a FCFS 
service center embedded in a product-form network. Appendix C, 
which uses multidimensional generating functions and the known 
relation between the stationary distributions and those seen by cus- 
tomer arrivals to a node,”' contains the details of the investigation. 

We denote 


Wis(t; K) = P(dis > t; K) (71) 


as the probability that a chain s arrival to node z is delayed in excess 
of ¢ in a system with population K. Figure 8 shows a single-chain 
example where the delay shown corresponds to the time between a 


* The inclusion of the results of this section into the mixed network algorithm shown 
in Fig. 6 is straightforward. 

‘Or other types not affecting the stationary queue-size distributions, e.g., random 
are or LCFS nonpreemptive. The identification as actual delay moments is, however, 
ost. 

*It is a straightforward matter to treat the limiting mixed network, as well as a class 
of state-dependent nodes (e.g., multiprocessor nodes) by the techniques presented. 
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TERMINALS 
(NODE ™) I/O PROCESSORS 





K TERMINALS 

M, = M-1SINGLE-SERVER NODES 
NODE M INFINITE-SERVER NODE Jy, = 
W(t; K) =P{d,> t;K} 


1 
MEAN THINK TIME 


Fig. 8—Closed system illustration—delay distribution. 


terminal request and the time the request first gets the attention of 
the CPU in a system with K terminals. References 22 and 23 contain a 
study of the response time distribution (queueing and service) for a 
single node being fed traffic from a collection of terminals, the classical 
machine repair problem with multiple repairmen. This could corre- 
spond to a multiprocessor version of Fig. 8, but without the 1/o 
processors. In Ref. 24 the asymptotic behavior, as the number of 
terminals increases, is studied. We note that the methods described in 
this paper can easily be used to obtain results for the response time 
distribution (delay plus service time) for the above example.* In 
general, the state of the art for obtaining response time distributions 
for multiple resource systems is quite limited! 

Figure 9 shows a representation of two loosely coupled systems with 
shared mass storage devices modeled as two central server models 
with some shared I/o queues.” For 1/0 requests served on a FCFs basis 
in each 1/o queue, the delay distribution of interest shown corresponds 
to the delay in accessing a disk, either dedicated or shared, for each of 
the component systems and to the delay in getting each of the cPpU’s 
for FCFs scheduling algorithms. We note that the population vector 
here, K = (Ki, Kz), could correspond to the degree of multiprogram- 
ming for each component system. 

We start our presentation of results with the single-chain case and 
then generalize to the multichain case. For the single-chain case, 
letting 


Wi(t; K) = P(d;>t; K), (72) 


* The single node being a limited queue-dependent server.” 

* We note that these response times could involve several visits to a given resource, 
as well as visits to other resources. This is studied in Ref. 25 for the single resource 
queue with feedback. Some other recent work is discussed in Section VIIT.?627.8 
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DEDICATED 
1/0 DEVICES 
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Fig. 9—Two systems sharing mass storage devices. 


it is shown in Appendix C that W,(t; K) satisfies the Mi order 


recursion 
M, 


Wi(t; K) = y(t; K-1)- Y @(K-1)Wilt; K-7), — (78) 
j=l 
where M, = M is the number of single-server nodes in the network. 
The coefficients a;(K — 1) are related to the coefficients of Z’, a;, in 


My, Xe My : 
P(Z) =T] (: -*z) = ¥ a7, (74)* 
j=0 


7=1] 
where J; is the relative arrival rate to node 1. 
Clearly, 


aj = (—1)/ > Pi,Pi, *** Pi, (75) 


Isti<tg< ++. <tj=M 


where pi, is the relative utilization \;/u:,. The coefficients of (73) 
a;(K — 1) are obtained from 


* We have arbitrarily labeled nodes 1 through M, to correspond to the single-server 
nodes. Since these nodes are state-independent service centers, in this section we denote 
the common mean service time at node i as pj', i = 1, 2, ---, Mi. 
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aj(K — 1) = (—1)’ b3 pi, (K — 1) 


<< ---<yjsM) 
-pi(K — 2) ++» pi(K—j), (76) 


which corresponds to (75) with the actual utilizations 
~ di, (K — 7) 


yj 


pi (K — j) (77)* 
evaluated at the appropriate population replacing the relative uti- 
lizations pi, Recall that the actual utilizations are available via 
standard analysis. The forcing function on difference equation (73), 
yi(t; K — 1), can be obtained recursively (see Appendix C) from 





pit _ Aco 
(t; K) = i(K){1+—] y(t; K -—1 78 
vi ) = a ( =) ) (78) 
and the initial condition 
yi(t; 1) = ep, (1). (79) 


The quantity a. represents the relative loading on all the infinite- 
server nodes 


s NN 


j=M,4+1 Wy 


is = (80) 


We note that for the network example of Fig. 8, 





yilt; K) = pi(K)(pr + wat) y(t; K — 1), 


1 
K-1 
where p is the fraction of cpU requests feedback to the terminals, 

mean think time 
= Se. Janie ta eee ee 
mean CPU service time 
and pit represents the point at which we are evaluating the tail of the 
delay distribution in units of CPU service times. 

For the multi-job-type networks, the tail of the node 7 delay distri- 
bution as seen by a chain s arrival is shown in Appendix C to satisfy 
the multidimensional recursion 


Wis (t; K) = yi lt; K — e,) — >» jy, jg,--- irk — es) 


(O<jit jot: +++ jrsM}) 
; W;.(t; K — jie: — jo@2 — ++ — Jrer). (81) 


The quantities qj,,...,j,(K — es) are the multidimensional analogues of 
aj;(K — 1) in (76), ie., they are related to the coefficients of Z+!- 


* Here A;(K) denotes the actual arrival rate to node i for a system with population K. 
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J J _ 4 
Zz Pre LE, Oy .+++ JR in 


ae = Nis 
i=] s=1 is 
Analogous to (76), we have 
Gij,..jg(K — @s) = (—1)VR 
{tpg}EL 
= (ji =~ 1)e,) | rs An [Pi,,;R(K — @;s — Jie 
rar 88 JR-1€R-1) eas Pi, pr (K = C= Jie€i = eas —(Jr a 1)er)]} : 


(82) 


(Li, 31K — es) +++ By, aK - e, 


where 


R 
r=1 


and {l,,} € L corresponds to 


li) < lg) Sees < Uj 

UR < lor < +++ Lyk 
1.€., Qj,,j9,---jnUK — @s) 1s a sum of products of utilizations with p;,_;, 
evaluated at the population (K — e, — j1@1 — je@2 — +++ Jp-1@r-1 — 


(p — le,). The forcing function is obtained recursively from 


(Qo (KK) = Ais(K)¢) yi (t; K _ e,); 


SEAR), sER*(t), K-=OVrE &* (i), 
vil; K) — K.6(K a e,) vill; K es); (85) 


sE Ai), K-=O0OVr€é &* (i), d5(K —e,) > 0" 

QO; K,>0 forsome ré&*(z), 

with initial condition 
oe ne | prleye rE BRi(i) 

Yilb; er) = f ré Ri). 29) 

Rt) denotes the set of chains passing through node 1, and & *(z) 


denotes the set of chains which either pass through node i or through 
an infinite-server node. The quantity 


s(K)= Y K, 


ana er (87) 
rER (t) pa 
5) 


TIf 6(K — e,) = 0 and K — e, ¥ 0 [if K — e, = 0, use initial condition (86)] y(t; K) can 
be computed from the first relation for some s € #(i), s € R*(i) corresponding to 
K; > 0. 
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the relative chain r loading on the infinite-server nodes 


M \. 
C= —, (88) 
J=M,+1 [jr 
and the actual chain r infinite-server loading 
Me\7r(K) oe 
a,.(K)= y — ee Ar (K) Gree « (89) 





J=M,+1 yr 


We note that it is only necessary to compute y;(t, K) in the subspace 
spanned by chains passing through node z or any infinite-server node. 
For the special case where M; = M (e.g., the central server model) 
A(t) =A*(t) and (85) becomes 





wit Kl Ky y(t, K—e)): 
K, \K| _] pis(K) yi(, K es); 
y(t, K)=2 s€A(i), K-=0VrER(i),|K|>1, (90) 


0; K->0O forsomer € &(t), 


where 


R 
IK] = 5 K,. (91) 


Vill. SUMMARY 


This paper has presented contributions to the foundations of a tool 
to support performance analysis and modeling activities aimed at 
answering some key questions at various stages of a computer system’s 
life cycle. The emphasis here has been on presenting easily and 
efficiently computable results for calculating distributional information 
and a stable, efficient method for dealing with congestion adaptive 
devices.' Mixed systems have been considered to allow us the gener- 
ality of dealing with traffic sources which are fundamentally different 
in their behavior. By obtaining results for different levels of customer 
aggregation, we allow one to consider a macroscopic or more micro- 
scopic level of detail. The virtual delay results allow us to quantify 
differences between service as perceived by an arriving customer and 
that perceived by a measuring device. 

We note that many open questions exist in the areas of obtaining 
results related to the distribution of total time a customer spends in a 
subnetwork consisting of several nodes with feedback (e.g., Fig. 1, 


"In a recent paper,’ a modification of mean value analysis is introduced to eliminate 
numerical instabilities when dealing with general state-dependent service rates. The 
method involves analysis of complementary systems and evaluation of marginal queue 
size distributions. 
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where the time of interest is the response time).* This is an area of 
active research in the literature (e.g., see Refs. 26 and 28); however, 
the results available are fairly restrictive with respect to the network 
topology’ or customer paths, and do not apply to, for example, the 
network of Fig. 1. Approximating the moments of the distribution of 
the overall time to transit a network from the individual node flow 
time moments Is one possible approach which would have to be studied 
and evaluated. The problem arises because of statistical dependence 
of a given customer’s flow times as he sojourns the network. For open 
Jackson networks, Reiman® uses a heavy traffic limit theorem to 
obtain a diffusion approximation for the network sojourn times. An- 
other area of importance relates to the inclusion of priorities in, for 
example, the cpu schedule. We note that an approximation technique 
(based on utilization adjustments) does exist’ for handling a class of 
priority disciplines and can perform quite satisfactorily in many 
cases." The approximation is such that it enables us to compute 
performance measures using results in this paper. 


APPENDIX A 
Recursions for Queue Size Factorial Moments 


For closed systems, it is known’ that the marginal, stationary prob- 
ability distributions satisfy 
1 nl Vis (K) 
(ki = k; K) = —— 
= pik) 2 Lis 





-p(ki= k — 1; K — eg); k>0O. (92) 


While conceptually, the desired moments could be computed from 
recursively computed marginal probability distributions, it is not rec- 
ommended. Other, more computable, approaches could involve the 
use of generating functions.” Our approach is to directly obtain a 
recursive relation for the moments. We use (92), (8), and (9) to obtain 
(10), with the indicated initial conditions in a straightforward manner. 
We obtain the required node-chain throughputs, A;,(K), by considering 
the lower level aggregation 


* Or, for that matter, at a processor sharing node embedded in a general closed 
network. The waiting time distribution for a specific closed network consisting of a single 
processor sharing node fed by a single finite population class is treated in Ref. 31. 

In Ref. 27 a computational methodology is given for obtaining upper and lower 
bounds where an arbitrary network topology is allowed. When applied to two M/M/1 
queues in tandem (for which the exact solution is known) the upper and lower bounds 
are close; however, over 30,000 states were used in the computation at 80 percent 
occupancy. Computational aspects are presented in Ref. 32. 

* Reference 34, which obtains a closed form solution for a two-node closed network 
with priorities, proposes a criterion under which the approximation technique would be 
expected to perform well. 


738 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1982 


ki = (Ra, Riz, «++ , Riz), 


where kj, represents the number of chain r customers at node 1. 
Denoting k; as the total number of customers at node i, we can write 
the recursive relation 


hy Xis(K) 
Ris pi (Ri) pis 


which is obtainable from the product form solution. From (93), (8), the 
standard Little’s law argument at each node and about the entire 
system, and the irreducibility of the routing chains we get (12) through 
(15). To obtain the desired recursion for the joint factorial moments, 
we use (8) and (93) in (18), make the appropriate identification corre- 
sponding to a system with reduced population and obtain (21). 

The mixed system result (26) is obtained directly from (10) by 
considering the augmented system with population 


K’ = (K| A431, enon: Kr), 


pilks; K) = pilki—es;sK—es);  Ris>0, (93) 


denoting 


By(K)= lim By(K!) 


and decomposing the sum over the open and closed chains. The lower 
level aggregation results are obtained by use of 


ie (k; — ef; K) > 
— —_—_—- p(k; — e{; K); s>r 
Ris wilRi) pis f 
lim pilki; K’) = (94) 
Kya +++, KR 0 a 
ki Xis(K) (k— ef: Ke.) e 
>; 7,7 . — fi — @s; —€@s ; s= r, 
Ris wi(Ri) pis - 


where e; is an R-dimensional unit vector in direction s as opposed to 
e;, the corresponding r dimension vector for s =r. 


APPENDIX B 
Recursions for Node Delay and Flow Time Moments 


Denoting W,,(¢, K) as the complementary delay distribution expe- 
rienced by a chain s customer at node z when the system population is 
K, we can write 

2 |K]—1 (wist)* 

Wi.(¢, K) = Tener 

4 te 

where jis = pj at the FCFS state-independent node under consideration, 
k;(tis) represents the number of customers at node z seen by a chain s 


e“'p (Ri{tis) = Rk; K), (95) 
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arrival to node i, and |K| = K, + Ko + --- + Kr. For the class of 
closed* systems, we are initially considering 


D(Riltis) = k; K) = p(ki = k; K — e,), (96) 


i.e., the distribution as seen by an arriving chain s customer Is equal to 
the distribution at an arbitrary point in time in equilibrium (i.e., the 
stationary distribution) for a system with one less chain s customer.” 
Using (96) in (95), we can obtain the Laplace-Stieltjes transform of the 
flow time distribution for node-chain pair (1, s) 
k+1 
yen 





. 7 |K|-1 
Tis (n, K) = rors by p(k: = k; K — es) ( 


ve Mi k=1 7+ Bi 


The jth moment, obtained 4 differentiation, satisfies 


Ti(K) = 2 i,J- AK) + vy “h(k +1) - -(k+ 7 — 2) 


pt k=] 
p(ki=k;K-—es), (98) 


and the summation in (98) can be written as 


|K[-2 w(K 
2 (q+1)--- (q+ 7-1) aaa 


-pikii=q;K-—e:—es), (99) 


where we have used (92) to get the one-step recursion on the tail of 
the marginal queue size distribution. Inserting (99) in (98) and making 
the appropriate identification, we get (46), with initial and boundary 
conditions as indicated. 
To obtain the Little’s Law type moment relation, we use the defi- 
nition of £;(K) and summation by parts to get 
[K|-j 


By(K)=7 Y +10 +2) --- 0+ 7-Vp(kizl+ 7; K), (100) 
l=0 


which upon repeated application of (92) results in 


R R jJ-1 
Bi (K) = > wren 2 Ais, (K)Ais,(K _ e;,) ke Ais, (K = DH e., 


$)=1 
ed 


ne y (I+1)- ++ j- Dp (k= EK e,), (101) 


Identifying the last term, we obtain 


* The mixed system results obtained by the usual limiting system argument are 
reported in Section VI. 
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He 


1 j 


Ss 1 


jJ-1 
Nis, (K)Ais,(K — @s,) +++ Nis, (K =e} e, 


q=1 


R 
By (K) = » 


j-l 
x Tis, (K- y e.) (102) 


q=1 


which for the special, single chain, case R = 1 gives the moment 
relation 


K 
By(K) = ( II nt) Ty(K —j+1). (103) 
l=K—j+1 

APPENDIX C 
Recursions for the Tail of the Node Delay Distributions 

We consider a closed system of M, state-independent, single-server 
nodes and M — M, infinite-server nodes and obtain an M‘" order 
difference equation for the tail of the customer delay distribution at a 
FCFS node. Denoting 


Wis(t; K) = P(dis > t: K) 


as the probability that a chain s arrival to node 7 is delayed in excess 
of ¢ in a system with population K, we can write 


a |K|-—1 (uit)* 
Wis(t;K) = >) ~—_e*ip(ki= k; K — e,). (104) 
r= (Rk — 1)! 


We denote the product-form normalization constant” as 
G(K) = p» £1 (ki) go(Ke) +--+ gu(Ku), 


Sk=K 
where 
kl! /Aa\" (Av 
g;(k,) oe ha ) dhas (2) 
gle oe FR. \ fj LR 
j= 1, 2, yen ,M,, 
Cmcr 
By LyR . 
gj (kj) eco J=M,+1,---M, (105) 
it +++ Rr! 


and we have labeled nodes 7 = 1, --- , M. as the single-server nodes. 
Multiplying (104) by G(K — e,) and obtaining the generating function, 
we have 


mi Our final result will not involve computation of the normalization constant, the 
calculation of which can result in numerical problems. 
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R . 
Si(Z) ( 5 ~ Z,) 


Lis(Z) = ee 
1- Y —Z, 
( Py bi 


a Nir 
exp| —ne ( -» 2.) |, (106) 
r=1 pi 


where 
Li(Z) = Ye Y Wit; K+ e)G(K)ZP «+» ZR, (107) 
K,=0 Kp=0 
and 
S(Z)= Yous VY G(K)ZE --- ZR, (108) 
K,=0 Kp=0 


where &;(K) is the normalization constant for a reduced network 
(node z absent) 


G(K)= ) gi(ki) «++ gi-a(Ki-1) Bini (kisi) +--+ gr(Kr). (109) 


y k=K 
ji 


Inserting 
Si(Z) = TT $;(Z), (110) 
j#i 


where 


1 
$$ j= Mn, 
R jr J 
1 a, > — Z, 
_ r=1 Lyr 
S)(Z) = 4 (111) 
» wT 2, 
evalnir j>M. 


into (106), we get 


R 
a Ne. Y (arethirt)Z, 
P(Z)Lis(Z) =e’ | > ra e” 4 Ait, Z), (112) 
r=] Pi 
where P(Z) is the polynomial given by (82) and a,.. given by (88). 
Inversion of (112) and division by the appropriate normalization con- 
stant results in 


M, M, Ii(K — ee, — jie: — +++ — jrer) 
2 ei ee 
6 joan G(K — es) 
Tite + JREM, 
_Ai(t,K-es) , 0, 
~ G(K-e,) ~ a 
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where aij,,...jz is the coefficient of ZZ? --- ZF in the polynomial (82), 
h;(t; K) is the inverse generating function of H;(t; Z) given by 


h(t, K) = e** I aad Vir-K, 


————;;_ (114) 
rER*(i) K;! rE AR(i) [i (Aree - Ait) 


A(t) denotes the set of chains passing through node z and &*(z) 
denotes the set of chains which either pass through node i or an 
infinite server node. The quantity /;,(K) in (113) satisfies 


l.(K) = Wi.(t, K + e.)G(K). (115) 


As it stands (113) represents an M‘ order recursion for the distribution 
tail; however, the coefficient and forcing function involve the normal- 
ization constant. Upon use of (115) in (113), and recognizing that 
G(K — e, — J1€1 — +++ — Jrer) 
G(K = e;) 


can be written as a product of node throughput proportionality con- 
stants 


i Ai(K — es — hes | i Ao(K = Jie1— bes) | cee 


1,=0 1,=0 


Jr71 | 
i Ar(K — es — Jie: — +++ — Jr-1€Rr-1 — nen | ) 
Ra=O 
we obtain (81) and (83). 

The recursions (85) for the forcing function 


: = h,(t, K) 
yi(t; K) = “G(K) (116) 


follow from 

(Aso + Aist) 8(K) 
K, 6(K—e,) 

where 6(K) is given by (87), and from 

(Asoo + Aist) 
Ks 


h;(t, K) = hi(t, K — es); sE€ A(t), (117) 


hi(t, K) = hi(t, K — es); sEA(i)sEAk*(t). (118) 
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Adaptive Intra-Interframe DPCM Coder 


By P. PIRSCH 
(Manuscript received August 4, 1981) 


Adaptive prediction schemes provide lower transmission rates than 
those obtained by simple previous frame prediction. In this paper, we 
measure the entropy of prediction errors for two types of adaptive 
intra-interframe prediction algorithms. In the first case, that predic- 
tor which results in the least prediction error for previously trans- 
mitted neighboring pels is selected from a set of predictor functions. 
In the second case, prediction is a weighted sum of previous frame 
and intraframe predictions, where the weights are changed from pel 
to pel by gradient techniques. We also investigate various modifica- 
tions of the basic methods. Further, a new type of variable length 
encoding in which the locations of the nonzero prediction errors are 
coded by horizontal run lengths is discussed. Compared with the pel 
entropy of previous frame prediction, the run length coding gives a 
gain of 2 to 16 percent, depending on the scene. Compared to simple 
previous frame prediction the first type of adaptive scheme in com- 
bination with horizontal run length coding provides a gain in entropy 
of 18 to 29 percent, whereas the second type of adaptive scheme 
provides a gain of 20 to 32 percent. 


I. INTRODUCTION 


The bit rate required for digital transmission of television pictures 
can be significantly reduced by interframe DPCM encoding. The coding 
method which has been widely proposed for video-telephone and 
video-conference application is conditional replenishment.” In condi- 
tional replenishment, each frame of a television sequence is segmented 
into changed and unchanged areas. Various methods can be used for 
encoding the changed parts of a frame. Intraframe predictive coding is 
very efficient for these parts.** In conditional replenishment, no infor- 
mation about the unchanged areas is transmitted. At the receiver, the 
unchanged areas are reconstructed by repeating from the previous 
frame. However, it is necessary to transmit address information that 


147 


indicates the location of the changed areas. Several modifications and 
improvements of the basic method of conditional replenishment have 
been made. Most by them are described in a survey by Haskell.” 

This paper describes adaptive intra-interframe prediction. It is ob- 
vious that stationary background of a frame is best predicted from a 
pel in the previous frame which has the same position as the pel to be 
predicted, whereas parts of a frame with moving objects are better 
predicted by an intraframe predictor. Therefore, a prediction scheme 
which provides automatic switching between the two types of predic- 
tors, depending upon the part of the picture, will result in lower bit 
rates. To avoid the transmission of additional predictor control infor- 
mation, the adaptive prediction schemes described here are based on 
previously transmitted reconstructed pels. Further, no forward seg- 
menter like that of conditional replenishment is used. Therefore, only 
the quantized prediction error has to be coded and transmitted. 

A block diagram of such a DPCM encoder with adaptive prediction is 
shown in Fig. 1. The investigations in this paper concern a comparison 
of the performance of two types of adaptive predictors. The first one, 
denoted by predictor selection, is a scheme where one predictor is 
selected from a set of predictors. In the second scheme, the predictor 
is a weighted sum of predictors and the prediction coefficients are 
changed continuously by a gradient algorithm. As a measure of pre- 
dictor performance, the entropy of the quantized prediction error is 
used. For three different television scenes an estimate of the entropy 
is obtained from DPCM simulations. The necessary measures against 
buffer overflow and underflow, in case of variable length encoding, 
have not been considered here. 

This paper is organized as follows. Section II gives a detailed 
description of the two basic algorithms and their modifications. Section 
III describes a variable length encoding scheme which is especially 
suited for Dpcm coders that have improved prediction. Results of 
simulations on real scenes are given in Section IV. 
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Fig. 1—Block diagram of a ppcm coder with adaptive predictor. 
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Il. DESCRIPTION OF THE PREDICTION ALGORITHMS 


Let f; be one of M predictor functions. If each f; is a linear predictor 


function, then 
N 


fi = » AS}, (1) 
jJ=1 

where aj; are the weighting coefficients and s; are previously transmit- 

ted pels. The prime in s; indicates that these are reconstructed pels 

which are known at the receiver. The subscripting for pels neighboring 

the present pel sp is shown in Fig. 2. The predictor functions f;, i = 1, 

2, --» M, are linear combinations of N pels, s;, 7 = 1, 2, --- N, which 


form a vector 
Ss 


s’ = : (2) 


SN 


No~ es 


In vector notation, equation (1) can be written as 
fi=ajs’. (3) 


Here the superscript 7’ denotes the transpose of a vector or matrix, 
and a; is the vector formed by the coefficients aj, 7 = 1, 2, --- N. The 
prediction value > is a weighted sum of all predictor functions, 


M 
fo =) bifi. (4) 


If f denotes the vector of elements f;, 7 = 1, 2, --- M, and b denotes the 
vector of elements b;, i = 1, 2, --- M, then 


So = b’f. (5) 





PRESENT FRAME PREVIOUS FRAME 


Fig. 2—Configuration and subscripting of picture elements. Pel so is the present pel 
to be predicted. Dotted lines denote scan lines from previous fields. 
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This description is general and includes switched prediction by allow- 
ing special values of b, such as 6; = 1 and b; = 0 for all 1 4 k. Combining 
(3) and (5) it follows that 


f = As’ (6) 
So = b’ Ag’, (7) 
where 
ay 
ee ed (8) 
aM 


is an M X N matrix. The set of predictor functions is described by the 
matrix A, with the coefficient vectors a; chosen such that a particular 
predictor function provides a good prediction for a specific area of a 
television scene, like stationary background, moving objects, etc. The 
algorithm then seeks to automatically adapt the vector b to various 
areas of a scene so as to minimize the prediction error. 

In this investigation, the set of predictor functions is restricted to a 
previous frame predictor 


fi = $20 (9) 
and an intraframe predictor 
fo = A218) + A2282 + a2383. (10) 


The following prediction algorithms are described for two predictor 
functions, but they can easily extend to more than two. 


2.1 Predictor selection schemes 


From a set of predictor functions, the predictor which results in the 
least prediction error for previously transmitted neighboring pels is 
selected as the predictor for the present pel. For each predictor 
function, a decision function u; is defined, which is the sum of the 
amount of the prediction errors for each pel in a small window of 
neighboring pels. The predictor which has the smallest value for the 
decision function is chosen as predictor. This criterion was also used 
by Stuller et al.° for gain and displacement compensation. The basic 
selection rule for two predictor functions is as follows: 


et hi if U, = Ue 
mat if m> tm, (11) 


where 


ui= ¥ [sr — fils:)|. (12) 
keW 
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The subscript 2 denotes a pel in a neighborhood W. The window W is 
chosen such that s? is known at the receiver. The decision function wu; 
can be evaluated at the receiver without transmission of additional 
information about predictor selection. For previously transmitted near- 
est neighbors, the index set W is 


W. = {1, 2, 3, 4}. (13) 


For real-time implementation the choice of W, creates problems, 
because of the use of the pel si. The time constraint for calculation of 
u; can be reduced by using the index set 


We = {2, 3, 4, 5} (14) 
or 
W, = {2, 3, 4} (15) 
instead of W.,. The window W, is also used by Stuller et al.° 
A further simplification for hardware implementation can be ob- 


tained by introducing a quantizer function Q,[°] in (12). Then the 
decision functions u; are given by 


ui = a Qs [sk — filsz) |]. (16) 


A modification which leads to a simpler implementation than the basic 
selection rule (11) can be described as follows. Choose the predictor 
function f; which has within a window W most frequent minimum 
magnitude of the difference 


din = Sk —- fi(sz). (17) 


In the case of two predictor functions at each position k, a binary 
variable uv, which describes which predictor function is better, is 
defined as follows, 


1 | diz | = | doz | 
= 18 
— {0 |diz| > | doz]. ( ) 
The decision functions u; are now given by 
Ui = > Up 
heW 
u2 = > Up, (19) 
keW 


where v; is the complement of v;. The predictor with smallest value u; 
is chosen. The selection rules as discussed above require that one 
predictor function be chosen even if both decision functions wu; are 
identical. An improvement can be obtained by using a “soft-predictor 
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switch,” 1.e., the prediction value is a weighted sum of predictor 
functions, as given by (4), with the weights 6; being proportional to the 
frequency of preference of the predictor function f;. Hence, for two 
predictor functions, 


I 
b= U 
7 ey : 
1 = 
b. =— », Uk, (20) 
N kew 


where 7 is the number of pels in the window W. To avoid the division 
by 3, for W = W, the contribution of the pel at position 3 to (20) is 
doubled, and n is chosen to be 4 for this special case. 


2.2 Adaptive prediction based on a steepest descent method 


The steepest descent’ is a mathematical method which has been 
often used for optimization. One advantage of this method is its 
simplicity. This method has been used frequently for adaptive systems. 
It is also proposed by Netravali and Robbins® and Stuller et al.® for 
motion-compensated prediction. Here it will be applied to adaptive 
intra-interframe prediction. 

Let us assume that the prediction value is a weighted sum of 
predictor functions as given by (5). Then the prediction error is given 
by 


e=s—b’f. (21) 


In the following, the present pel is denoted by s, rather than so. The ~ 
variance of the prediction error e is a quadratic function in b. 


F(b) = E[(s — b*f)’), (22) 


where E[*] is the expected value. The gradient with respect to b is 
given by 


g = VbF(b) = —2E[(s — b’f)f] (23) 
= —2E[ef |. 


The steepest descent is an iterative method, where starting from an 
initial guess the vector b is modified recursively according to, 


beth = pb” — yi) ge) | (24) 


The adjustment of the vector b™ is made in the direction of the 
negative gradient. The scalar 7) has to be optimized by a one- 
dimensional search scheme at each step m. However, real-time appli- 
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cations are performed with a constant value of y. The best value of y 
depends on the type of data. In addition, the value of y influences the 
stability and the speed of convergence of b. 

From eqs. (23) and (24), it follows that an adaptive prediction 
scheme which, based on a gradient method, is given by 


bi) = b™ + IF wlef]™, (25) 


where £,,{° ] 1s the expected value within a small window of neighboring 
pels as given by (13), (14), or (15). The coefficient vector b is updated 
on a pel-by-pel basis along the scanning direction, i.e., if b°*” is the 
coefficient vector at the present pel, b™ is the coefficient vector at the 
previous pel. At the beginning of each line, an initial estimate of b is 
used, e.g., the mean of b at the previous line. Simulations indicate that 
because of a fast adjustment an initial vector b with elements 6; = 
1/M, i = 1, 2, --- M is appropriate. 

In this study, several modifications of the recursion given by (25) 
have been investigated. The various algorithms will be compared with 
respect to prediction gain and cost of implementation. A high predic- 
tion gain requires an appropriate value of y in (25). Simulations with 
several values of y indicate that for video signals with normalized range 
[O, 1] the optimum value of y 1s about one. In such a case the 
adjustment from pel to pel is relatively small, and the transition from 
one predictor function to another takes several pels. By introducing an 
additional constraint 


M : 
eae (26) 
jJ=1 


the value of optimum j is increased to about 64. The increased value 
of y provides a shorter transition from one predictor function to 
another and the constraint (26) improves the stability of the algorithm. 

With the constraint of (26), the steepest descent method has to be 
modified to minimize the augmented function of (22) 


O(b, A) = E[(s — b’f)?] + A(b70 — 1), (27) 


where o is a vector with all elements equal to 1. The coefficient vector 
b is updated recursively by 


boty = b”™ = y(—2E[ef]™ a Ao). (28) 


Using (26) to eliminate \°” from (28), and replacing E[*] by Ew], 
then 


b+) = bh™ 4 27 C Eyfef], (29) 


where C is an M X M matrix given by 
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,2 _3 il 
M M M 
1 1 1 

CH 1f=< 56° = S30, iste ey ee 
U oo 7 1 7; 7; (30) 

ee Sere 

M M M 

and U is the unit matrix. Because of (26), it follows that 
s — b’f = b7(so — f) = bd, (31) 


where d is a vector of differences similar to (17). This leads to an 
equivalent recursion of (29), given by 


bi) = b™ — 27CEyfed]. (32) 


In the recursions given above, the coefficient vector b at the previous 
pel is updated by an adjustment to obtain the coefficient vector at the 
present pel. However, a picture is two-dimensional in nature, the 
values of b for pels from the previous line in the immediate neighbor- 
hood of the present pel are quite close to that of the present pel. This 
idea results in a modification of (25) which is given below. 


be) = Ew{b]” + 2y Evy({ef]”. (33) 


Let us assume that the samples s and the predictor functions /; are 
represented by 8 bits. In such cases, in the recursions given above at 
each position within the window, a product of two 8-bit numbers has 
to be calculated. A reduction in the cost of implementation can be 
achieved by using the three-level quantizer, shown in Fig. 3, for the 
prediction error e and the differences d. These investigations show 
that a three-level quantizer with a dead zone is more advantageous 
than the signum function used by Netravali and Robbins.® 

The algorithm (29) and (83) for the case of two predictor functions, 
in combination with a three-level quantizer Qp, results in the following 
recursive scheme, 


by"? = Ew[bi] + yEwl@ole)Qo(fi — fr)" 
by) = Ew be] — yEw{Qn(e)Qo(f — fr], (34) 
with the constraints 
b) + bg = 1 
0= db 
0 = bp. (35) 
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Fig. 3—Three-level quantizer for gradient quantization. 


The latter two constraints of (35) were introduced to avoid negative 
weighting coefficients. 


lll. VARIABLE LENGTH ENCODING BY HORIZONTAL RUN LENGTH 


An adaptive prediction scheme leads to many predictable pels. A 
pel is described as predictable if its quantized prediction error is 
represented by the level zero. To obtain a low transmission rate, the 
quantized prediction error is coded by a variable length code. There is 
always a loss in mean transmission rate compared to the entropy if not 
all of the negative logarithm of the probability of the prediction error 
representative levels are integer. This loss is especially high if one level 
has a probability much larger than 0.5. For adaptive prediction 
schemes, this is true for the quantizer level zero. To overcome this 
problem, block coding is frequently used. For the application described, 
a special coding scheme is proposed. 

From each frame, a two-level picture is generated which indicates 
where the pels with zero code words (zcw) and where the pels with 
nonzero code words (NZCW) are located. This new picture can be coded 
by known one-dimensional and two-dimensional coding techniques for 
two-level pictures. The Nzcws are coded in parallel by a variable- 
length code like a Huffman code and multiplexed with the code words 
of the two-level picture such that the receiver can decide between the 
two types of data. A block diagram of such a coder is shown in Fig. 4. 

For a horizontal run length code, the set of symbols to be coded is 
listed in Fig. 5. For each of the sets, 1.e., zero runs (ZR), nonzero runs 
(NZR) and nonzero code words (NZCW), a variable length code can be 
determined independently and matched to the probability of the 
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DPCM VARIABLE LENGTH ENCODER 
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Fig. 4—Block diagram of a new type of variable length encoding. 


REGULAR SET OF CODE WORDS 


{0,1,2,..+,k} 


NEW SETS FOR CODING 


{i) Set of nonzero code words (NZCW) 


(172,seagkt 


(ii} Set of zero runs (ZR) 


i ZR 

0 1 

1 01 

2 001 

3 0001 

n oo00...01 
nti 0000... 00 


ii) Set of nonzero runs (NZR) 


i NZR 

0 0 

1 10 

2 110 

3 1110 

m 1111... 10 
m+1 1111...11 


Fig. 5—Set of symbols for horizontal run length coding. 


symbols of that particular set (e.g., Huffman code). The type of runs 
are chosen so as to allow a wrap-around coding from line to line. Wrap- 
around coding means that a run is not terminated at the end of a line 
but continued in the next line. Furthermore, the longest run to be 
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coded could be shorter than one line. The code words must be trans- 
mitted in a sequence so that the receiver always knows which code 
table must be used for decoding. Fig. 6 gives an example in which a ZR 
is transmitted at the beginning of a line. In this example, it is also 
assumed that the Nzcws are transmitted just after the corresponding 
run. 

The entropy 


H=- bD Di log pi (36) 


is used as an estimate for the mean code word length, with p; being the 
relative frequency of the ith code word derived from the DPCM simu- 
lation of a TV sequence. The variable length code described above 
consists of three independent codes. Hence, the entropy Hrun in bits 
per sample is given by 


MNZCW Nz nn 


R 
Herun = Hyzcw + Az + 
NPEL MPEL 











= H. NZR, (37) 
NPEL 
where 7 is the number of events specified by the subscript. 

An advantage of the type of run length coding presented here is that 
in the case of statistically independent symbols, the overall entropy is 
not changed (Hpre, = Hun). In the case of interframe coding, the zeros 
and nonzeros are grouped together because they are related to the 
picture content. In this case, a decrease in entropy is achieved by the 
horizontal run length coding. 


IV. SIMULATION RESULTS 


Computer simulations were performed for the prediction algorithms 
given above using three different television sequences. These se- 
quences are the same as those used in Refs. 6 and 8. Each sequence 


LINE OF INPUT CODE WORDS 


0004560320000040000 0 


BINARY ZERO-NONZERO PATTERN 


poo jis of{s fooo0ojoo0o} m | 
fe 


oo | | | 
ZR 3 0 4 4 
NzR_ | [| 2 Ila lol | 


— 


POSSIBLE CODE STRING 
ZR3, NZR2, CW4, CW5, CW6, ZRO, NZR1, CW3, CW2, ZR4, NZR@, CW4, ZR4 


Fig. 6—Example of a horizontal run length code. 
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consists of 60 frames obtained by sampling a video signal of 1-MHz 
bandwidth, at the Nyquist rate. Each sample was quantized to 8 bits. 
One frame of each sequence is shown in Fig. 7. 

One scene, called Judy, is a head-and-shoulders view of a person 
engaged in active conversation. The second scene, John and Mike, 
shows two people entering the camera field of view and walking briskly 
around each other. The third sequence, Mike and Nadine, is a panned 
view of two people always in view of the camera. 

Even though the quantizer characteristic of a Dpcm coder should be 
designed according to the prediction scheme, for simplification in these 
investigations, the same 35-level quantizer shown below was used for 
all simulations. The quantizer has the following positive representative 
levels: 0, 5, 12, 19, 28, 37, 46, 57, 68, 79, 90, 103, 116, 129, 142, 155, 168, 
181. This quantizer was chosen since it gave good picture quality, 
although the quantization error was visible in specific picture areas 
under short viewing distance. The decision levels are always in the 
middle between two succeeding levels. The performance of the predic- 





Fig. 7a One frame out of each sequence—Scene Judy. 
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7c One frame out of each sequence—Scene Mike and Nad 
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tion schemes was evaluated by computing the pel entropy, the entropy 
of a horizontal run length code, and the variance of the quantized 
prediction error. 

For comparison of adaptive and nonadaptive schemes, results for 
four nonadaptive predictors were obtained. The nonadaptive predic- 
tion schemes which were used are given below. 


§ = $50 (38) 
§ = 8 — sh + sho (39) 
aR ae peg e ghee oy fg ar (40) 
2 gt Pig 4 3 4 20 4 21 4 22 4 23 

A 7 , 4) , 6 , 

camer ce g 82 + 2 83. (41) 


The first predictor (38) is simple previous frame prediction. The 
prediction scheme given by (39) is frequently proposed for interframe 
coding.”” The predictor (40) is a three-dimensional predictor proposed 
by Klie’® for moving areas of a picture. Equation (41) describes an 
intraframe predictor which minimizes the variance of the prediction 
error.” 

The results of the nonadaptive predictors are shown in the upper 
part of Tables Ia, b, and c. These investigations show that previous 
frame prediction (38) is advantageous for sequences with not much 
motion (Judy), while the intraframe predictor (41) and the predictor 
(40) are better for sequences with rapidly moving objects (Mike and 
Nadine). An additional decrease in entropy can be obtained by using 
the horizontal run length coding scheme. This gain is especially high 
(16 percent) for the sequence Judy where ZR and NZR are better 
grouped. 


Table la—Entropy per pel and variance of the prediction error for 
nonadaptive and adaptive predictors—Scene Judy. 
Entropy in Bit Per 


Pel 
SS Variance 
Ape. Arun E[e?] Prediction Scheme 
1.035 0.875 16.6 Previous frame, eq. (38) 
1.120 0.953 8.5 2-D Interframe, eq. (39) 
1.349 1.297 9.1 3-D Interframe, eq. (40) 
1.840 1.760 31.7 2-D Intraframe, eq. (41) 
0.838 0.765 5.3 Predictor selection, eq. (11), (12), Wa 
0.781 0.718 4.8 Predictor selection with soft switch eq. (18), 
(20), Wa 
0.783 0.730 4.9 Gradient algorithm, eq. (34), Wa 
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Table Ib——Entropy per pel and variance of the prediction error for 
nonadaptive and adaptive prediction—Scene John and Mike. 


Entropy in Bit Per 
Pel 
a as Variance 
Hoegr Arun Efe?] Prediction Scheme 
2.393 2.190 142.1 Previous frame, eq. (38) 
2.400 2.286 114.1 2-D Interframe, eq. (39) 
2.154 2.094 61.5 3-D Interframe, eq. (40) 
2.397 2.323 88.9 2-D Intraframe, eq. (41) 
1.795 1.711 39.6 Predictor selection, eq. (11), (12), Wa 
1.774 1.687 36.7 Predictor selection with soft switch eq. (18), 
(20), Wa 
1.724 1.629 34.2 Gradient algorithm, eq. (34), Wa 


Table lc—Entropy per pel and variance of the prediction error for 
nonadaptive and adaptive predictors—Scene Mike and Nadine. 


Entropy in Bit Per 
Pel 
re Variance 
Apen Arun Efe?] Prediction Scheme 
2.859 2.809 194.9 Previous frame, eq. (38) 
3.008 2.982 250.0 2-D Interframe, eq. (39) 
2.537 2.504 108.0 3-D Interframe, eq. (40) 
2.546 2.506 117.1 2-D Intraframe, eq. (41) 
2.385 2.353 87.4 Predictor selection, eq. (11), (12), Wa 
2.370 2.336 80.8 ene selection with soft switch eq. (18), 
20), Wa 
2.325 2.284 77.2 Gradient algorithm, eq. (34), Wa 


Adaptive prediction schemes as given in Section II were simulated 
with (38) and (41) as predictor functions. The average bit rate per pel 
for three schemes are shown in the lower part of Tables Ia, b, and c. 
The adaptive schemes give an additional decrease in entropy if the 
horizontal run length coding technique is used; this improvement 
depends upon the type of picture. 

Compared to the case of simple previous frame prediction, the 
predictor selection in combination with horizontal run length coding 
results in reductions of 18 to 29 percent. The corresponding reductions 
for the more sophisticated gradient method are 20 to 32 percent. The 
minimum and maximum entropy of a single frame within a sequence 
are reduced by about the same amount as the average entropy of the 
sequence. This can be recognized for the gradient method in Fig. 8, 
which shows the entropy per pel of each frame versus frame number. 

In Section II, several modifications of the basic methods, to obtain 
a simpler hardware implementation, were presented. Most of these 
modifications have only a small influence on the entropy. The basic 
predictor selection scheme requires the summations of 8-bit numbers 
for determination of the decision functions (12). A coarse four-level 
quantizer 
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Fig. 8-——Plots of entropy per pel versus frame number for each sequence. Configuration 
one shows the pel entropy Hpe of previous frame prediction; two shows the horizontal 
run length entropy Hrun of previous frame prediction; and three shows the horizontal 
run length entropy Hrun of the gradient algorithm (33) with the constraint (26). (a) 
Scene Judy. (b) Scene John and Mike. (c) Scene Mike and Nadine. 
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0 O<|xl< 6 
—jJ1l 685 |x| < 18 
QI) = 49 18 <|x| < 36 (42) 
4 36<|x| 


for determination of the decision function (16) increases the entropy 
by about 1 percent. 

The use of a binary variable v;,, equation (18), which indicates which 
predictor function is advantageous at the position k, in combination 
with the soft-switch algorithm of equation (20) is to be preferred. 
Compared to the predictor selection scheme (11), (12), this algorithm 
provides a reduction of up to 7 percent in entropy. In addition, it is 
easier to implement. 

For the gradient method the algorithm (34) which incorporates 
several modifications of the original method is useful concerning cost 
of implementation and the reduction in entropy. The constraint (26) 
is especially advantageous. For the algorithm (34), a three-level quan- 
tizer with thresholds at +4 was used. The optimum value of y was 
found to be 1/4. Each line started with initial values b,; = 1/2 and 
b2 = 1/2 for b. As long as the weighting coefficients b; are represented 
by more than 4 bits, the gradient method provides a small gain in 
entropy compared to the predictor selection schemes. 

In these investigations, three windows W,, Wz, and W, were used. 
The window W, provides results very close to that of W., whereas W, 
provides an increase of about 2 percent in entropy. 

Further, it was found that using three predictor functions (the 
intraframe predictor is now split into two functions, one for horizontal 
prediction and one for vertical prediction) is not better. Besides the 
intraframe predictor function (39), the predictor function 


1 1 
h=5sitsss (43) 


was also used. This resulted in an increase of 4 to 5 percent in the 
entropy. | 

It is of interest to know how these adaptive schemes perform in 
comparison with conditional replenishment and displacement compen- 
sation schemes. The results published in Ref. 6 (Table I, page 1235) 
based on the same source data are of some interest in this context. 
Hence, a comparison is possible, but it should be noted that the 35- 
level quantizer used in this investigation is a modification of the one 
used in Refs. 6 and 8. Further in this investigation, an additional 
thresholding of prediction error is not performed. 

Compared to conditional replenishment the adaptive schemes pro- 
vide a reduction in entropy of 19 to 38 percent, depending upon the 
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scene. For active scenes like John and Mike and Mike and Nadine, the 
adaptive schemes provide a data rate close to that of displacement 
compensation (within +5 percent range). The run length coding 
scheme provides an additional reduction in entropy for sequences with 
low activity. For the sequence Judy, this reduction is 26 percent 
compared to conditional replenishment in case of previous frame 
prediction in combination with run length coding. 


V. CONCLUSION 


The performance of two types of adaptive intra-interframe predic- 
tors in combination with horizontal run length coding was studied. 
The gain in entropy of the predictor selection scheme is nearly as high 
as that of an adaptive scheme which is based on a gradient technique. 
Various modifications of the two basic methods which were investi- 
gated provided only small changes in entropy. Therefore, the adaptive 
algorithm which has the lowest cost of implementation should be 
chosen. 

Further investigations are necessary for the quantizer design and 
the buffer control in a fixed rate system. A combination of the described 
adaptive intra-interframe algorithms with motion compensation will 
result in a more sophisticated system which provides further entropy 
reduction. 
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Human Performance Engineering 
Considerations for Very Large Computer- 
Based Systems: The End User 


By |. S. YAVELBERG 
(Manuscript received May 15, 1981) 


Effective Human Performance Engineering for a large-scale, com- 
puter-based system involves many complex strategic and tactical 
decisions regarding the computer system design, the target user’s 
behavior, and the organization/environment. Descriptions of the 
more important performance considerations are presented. These are 
based primarily on the experience accrued during the last several 
years in the building of Bell Laboratories centrally developed com- 
puter systems for use by telephone company loop operations personnel 
in assisting them to do their job. The target population can be broken 
down into these distinct classes: End Users, Database Maintainers, 
and the Data System Support Staff. This paper focuses on the End 
User, and specifically the Bell System Service Representative. Impor- 
tant points include: (i) early emphasis of human performance consid- 
erations in the computer system design process can reap valuable 
benefits; (ti) care must be taken to specify input/output design features 
which have gone through the human/system engineering step of 
identifying a favorable payoff versus penalty ratio; and (ili) based on 
measurement data and user interrogation, computer system availa- 
bility and transaction failures and response times can seriously 
damage user performance and system acceptance. 


l. INTRODUCTION 


Effective Human Performance Engineering (HPE) for a large-scale 
computer-based system involves many complex strategic and tactical 
decisions with regard to the computer system design, the target user’s 
behavior, and the organization/environment. Descriptions of the more 
important performance considerations are presented. These are based 
primarily on the experience accrued during the last several years in 


165 


the building of Bell Laboratories centrally developed computer systems 
for use by telephone company loop operations personnel in assisting 
them to perform their job. Wherever possible, observations are com- 
pared to recent literature associated with other systems or experi- 
ments. 

The major in-place elements that affect human performance are the 
organization/environment and the behavioral considerations of the 
workers themselves. The introduction of a computerized support sys- 
tem adds another element. Presumably, the computer improves the 
effectiveness of the worker in performing the job. It also provides some 
design flexibility to counter some of the negative aspects of the other 
three existing elements. However, the computer system itself intro- 
duces serious behavioral effects on its users that may negate its 
benefits. 

The target population can be broken down into these distinct classes: 
end users, database maintainers, and data system support staff. This 
paper focuses on the end user with special emphasis on the Bell System 
Service Representative. Principal points made are as follows: 

e An understanding of the target population’s attitude and behavior 

toward computers is necessary. 

e Early emphasis of human performance considerations in the com- 
puter system design process (workflows) can reap valuable bene- 
fits. Unworkable or unreasonable performance requirements on 
the user can be substantially avoided. 

¢ Care must be taken to specify input/output (1/0) design features 
which have gone through the human/system engineering step of 
identifying a favorable payoff versus penalty ratio. 

e The introduction of a computer to the existing organization/en- 

vironment should stimulate some adjustments (job assignments) 
to optimize the mechanized features. Every attempt should be 
made to do this as early as possible. However, if workers in one 
department are asked to increase (or even change) their workload 
to make it easy to program a computer helping another depart- 
ment, there is extensive resistance. 
Operational—more so than functional—characteristics of the com- 
puter system (availability, transaction response time, transaction 
failures) can have a serious, and often underestimated, negative 
effect on user performance. For example, many sporadic short 
outages can have a significantly greater effect on user performance 
than a single, extended outage. 


ll. MAJOR CATEGORIES OF HUMAN PERFORMANCE CONSIDERATIONS 


The model that represents the relevant physical and behavioral 
elements and interrelationships has, as the central element, the collec- 
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tion of people whose performance is what this paper is all about: the 
target population or the end user of the computer system. Surrounding 
the user are three major categories, each of which introduces human 
performance considerations. The computer system is related to the 
user by a two-way interface which represents physical, as well as 
behavioral, interactions. The two in-place categories represent organ- 
izational/environmental considerations and the behavioral considera- 
tions of the user. These two categories have unidirectional interfaces 
representing behavioral influences only. 

In contrast to other models in the field, I chose to group organization 
and environment together and add the design and constraints of the 
mechanized elements (main frame, communications, terminals) under 
the umbrella category, computer system. I do not advocate waiting for 
the performance deficiencies to occur before addressing solutions. We 
can predict potential performance problems and avoid them. This 
important philosophy is emphasized throughout this paper. Most of 
the material concerns the human/computer interface: in a positive 
sense, where interface features relieve user performance problems 
associated with the precomputer job and, in a negative sense, where 
operational characteristics of the interface itself introduce new per- 
formance difficulties. 


li. TARGET POPULATION 


Without question, knowing the target population is fundamental in 
solving any human performance problems. In particular, it cannot be 
overstressed how necessary an understanding of the target population’s 
behavior towards computers is with respect to effective performance. 


3.71 Population classes 


For many large computer-based systems, the target population really 
changes with delivery. In the predelivery (usually manual) environ- 
ment there are generally two classes of users: customer service end 
users and back-room record maintainers. With the introduction of the 
computer system, three classes, generally distinguished by their level 
of expertise with computers,’ emerge: the unskilled, the semiskilled, 
and the skilled. In a Bell operating company (Boc) environment, the 
jobs associated with these levels are often end users and their super- 
vision at the unskilled level; database maintenance clerks and their 
supervision at the semiskilled level; and data systems support staff at 
the skilled level. It is important to remember that skill here applies 
specifically to computer use and not to the whole job. I prefer to 
identify these classes as end users, maintenance clerks, and operating 
staff. 

Within this target population framework, human/computer inter- 
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face design should be multifaceted, and groomed specifically for each 
class. 


3.2 End user: the Service Representative 


This paper deals with the end user. Most of the field experience, 
obtained between 1978 and the present, is associated with the Bell 
System Service Representative (Service Rep). The computer system 
for which the considerations are based is Premises Information System 
(PREMIS), which is a Bell Laboratories system designed to provide 
BOCs with customer-related information to help determine service 
order information. 

PREMIS supports the Residence Service Center Rep on-line during 
new customer negotiation and also provides some important data that 
must go on the service order. When a customer calls the Boc to place 
an order for new residential service, there are a number of tasks that 
must be performed by the Boc Service Rep taking the order: 

(t) Get the new address correctly. 
(iz) Sell telephone service at the new address. 

(iit) Establish the credit class for the customer to determine if a 
deposit 1s required. 

(tv) Quote charges and rates. 

(v) Give the date service will begin. 

(ut) Assign a telephone number. 

The Service Rep records all of this information on a service order, 
which will be used by other departments in the BOoc to provide service. 
PREMIS aids in all of these tasks and replaces the paper records, 
microfiche, telephone calls, or guesswork used previously. The physical 
architecture consists of a very large UNIVAC computer, a communi- 
cations network built around BANCS, another Bell Laboratories prod- 
uct, and 40/4 terminals located at each Service Rep’s position. The 
computer is administered by a data systems group, and the database 
1s maintained by several dispersed maintenance groups. Primarily, 
PREMIS is an address-keyable system providing the needed address- 
related data. The Service Rep simply keys the new address into a 
preformatted mask on the terminal. Usually all that is needed is the 
house number and street name. 

PREMIS responds with information about the geographic area that is 
needed on the service order. This includes wire center, exchange, rate 
zone, tax area, directory group, and the service features available for 
the address. This same display will also include the existing customer’s 
name, telephone number, and presence of an in-place connected circuit 
or loop from the address back to the central office. 

As of December 1980, PREMIS served almost two million residences, 
accounting for almost half of South Central Bell Telephone Company’s 
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residence service orders. About 1000 Service Reps are now accessing 
the system. 


3.3 Behavioral considerations 


Human behavioral factors that are particularly relevant to the 
introduction of computer support have been widely recognized: short- 
term memory limits, the need for closure, the desire for control, 
motivational characteristics, and the fear of computers themselves. 
Computer system designers—especially those responsible for the hu- 
man/machine interface—can effectively incorporate design features 
which protect against the negative behavioral effects of these factors. 
This paper will cover many of these features. Perhaps, the anxiety or 
fear of computers is the most underrated and is discussed next. 

Extensive research, experimentation, and observations have been 
noted by Shneiderman showing that user attitudes—fear of com- 
puters—can have a major effect on learning and performance.’ His 
data survey suggests that “Novices with negative attitudes towards 
computers learned editing tasks more slowly and made more errors. 
Anxiety, generated by fear of failure, may reduce short-term memory 
capacity and inhibit performance.” In a later paper,’ Shneiderman 
notes that part of the training requirements must be to overcome a 
distorted role of a computer as perceived by the potential user. The 
media and computer manufacturers have unfortunately characterized 
computer capabilities by life-like behavior which results in potential 
trainee resistance in the form of increased apprehension, resistance to 
technology, and anxiety interfering with learning potential. Shneider- 
man cautions against predictions such as “people and machines are so 
similar that with a few years effort, they should be able to produce 
machines that are superior to people.” He states that “this naive view 
is useless as a goal and harmful in destroying people’s expectations of 
themselves and how they will use computers.” Experience with new 
users of PREMIS confirms this apprehensive attitude generally among 
the most experienced, entrenched, settled-in population. However, the 
younger, newer employees often showed enthusiasm and genuine 
wonder at how the computer can help them do their job. In fact, they 
tended to press for an education into “how the computer works inside,” 
rather than limit their involvement to the minimal training on trans- 
action I/O. 

The negative attitude towards mechanization has sometimes ex- 
tended in serious directions. While I have never encountered sabotage, 
evidence of disgruntled employees damaging a delivered computer 
system exists,* apparently out of frustration at not being properly 
trained. One case resulted in a prison term for consciously sabotaging 
a system dozens of times during a period of 18 months. 
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Jones emphasizes the need for building up the confidence of a 
potential user by incorporating into the design of the computer system 
features which give the user the feeling that: “his or her commands 
will be obeyed; the data are in safe hands; a good and thorough job is 
being done; and the machine is going to help the user.” 

The desire for control can be best satisfied if the user is the initiator 
of all human/computer interface sessions. This principle is easy to 
support if the end user is getting “information retrieval only” support 
from the computer, as in the case of the Service Rep. For data system 
support positions where the computer initiates the need for work, this 
is, of course, not possible. 

The psychological needs associated with short-term memory limits 
and the need for closure (the completion of a task leading to relief), 
are interrelated. Excesses can lead to delays, forgotten items, and 
increased error rate. The key computer-related factors affecting these 
needs are the careful functional design of human/computer interface 
sessions into discrete subtasks and the transaction response time 
delays and variations for a variety of operational conditions. Both of 
these factors will be examined in this paper. The important point, 
made by Miller, is that “a psychological closure permits at least a 
partial purging of short-term memory; waiting time invokes stress.” It 
is believed that more extended delays (as in computer response time) 
“can be tolerated just after closure rather than in the process of 
obtaining closure.”® The idea is to have a system design which does 
not lead to loss of user concentration, that is, maintain continuity of 
human thought processes. 

Finally, certain characteristics associated with the design and deliv- 
ery of a computer system can seriously damage the motivation of the 
user. Three such characteristics which have proven important in 
PREMIS were generally identified in an earlier paper by Hackman:’ 
experienced responsibility for work outcomes; experienced meaning- 
fulness of the work; and knowledge of results. These motivational 
needs, as well as the other psychological considerations discussed 
above, will be related to PREMIS field experience discussed later in this 


paper. 


IV. COMPUTER SYSTEM CONSIDERATIONS 


This category is a catch-all for mechanized system elements, hard- 
ware, and software. It also includes the physical attributes associated 
with the human/machine interface design. Discussed below are ex- 
amples of design, development, and field experience in this category. 

Major elements of the physical architecture of PREMIS are as fol- 
lows: 

e Computer—UNIVAC 1100 series. 


770 =THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1982 


e Communications Network—BTL/South Central Bell BANCS 
THP System resident on Control Data Corporation (CDC) Cyber 
1000; Connectivity via 50-kb high-speed lines. 

° 'T’erminals—Dataspeed Mod 40/4 series; Input device—QWERTY 
keyboard with function keys; English language; output device— 
CRT display. 

At a lower level, many operational and functional system consider- 
ations remain which can fundamentally affect user performance. Be- 
fore developing some of these specific issues, I offer some recommen- 
dations on the staffing and timing of the system design activity. 


4.1 Early HPE influence in system design 

I believe human/machine interface issues can be best understood 
from a user perspective by professionals in the broad HPE field. Early 
analysis by HPE professionals can help provide a sound basis for system 
design decisions. By working with hardware and software professionals, 
they can substantially avoid unworkable or unreasonable performance 
requirements on the user. Bennett agrees, noting that human/machine 
interface design “will be most expeditiously advanced if those already 
trained in human engineering join the design team rather than if 
software people attempt to learn engineering.”® This position is not 
universally accepted, but certainly trending in this direction. Bennett 
goes on to paraphrase McCarn: “... experts in computers see well- 
bounded problems with great precision” but have difficulty in accept- 
ing the idea “that human communication is much more complex, and 
its context more extensive, than was initially conceived by computer 
programming staffs.” I believe this may have been generally true 
several years ago, but lately, with extensive teamwork among HPEs 
and computer science types under our belt, it is not unusual for a 
genuine interest and insight of the human side to evolve in software 
professionals. 

Regarding the detailed human/machine interface design, one key to 
ultimate high user performance is simplicity in learning. Shneiderman 
stresses this by noting that simplicity of design can best meet the 
bottom-line economic objectives of the system (e.g., computer proc- 
essing efficiency, storage capacity, communication network load) and 
still generate the desired user performance in terms of reliability, 
reduced error frequency, and enhanced satisfaction. Closure, which is 
strongly influenced by short-term memory considerations, must also 
be considered. The user receives “great relief when information is no 
longer needed to be retained. There is ... a powerful desire to complete 
a task, reduce memory load, gain relief.” In terms of operational design, 
transaction response time requirements obviously are tightly coupled 
with this behavioral consideration. From a functional design view, a 
direct design guideline is to formulate transactions which allow the 


HUMAN PERFORMANCE ENGINEERING 771 


same user to complete each task in sequence. These issues are all part 
of a fundamental principle: A key to influencing ultimate positive 
acceptability of a large computer system is the initial approach taken 
in system definition. I am a believer in formally going through these 
steps: 

1. Functional decomposition, analysis, and allocation—This, in ef- 
fect, divides up the tasks between humans and machines, making the 
best use of the strengths of each. 

2. Work flows—This shows how the new job will get done in a 
logical sequence. 

3. Human/machine interface design—Finally, from the work flows, 
the interface points can be supported by detailed 1/o design. 

Martin refers to these three steps as “functional,” “procedural,” and 
“syntactical.” If Step 1 is not done, the ball game may be lost before 
it starts. Jordan recognized this many years ago: “Men are flexible but 
cannot be depended upon to perform in a constant manner, whereas 
machines can be depended upon to perform consistently but have no 
flexibility whatsoever.” Martin, more recently, says it best: “The 
difference in “thinking” talent—the computer being good for ultrafast 
sequential logic and the human being capable of slow but highly 
associative thinking—is the basis for cooperation between man (hu- 
man) and machine. It is because the capabilities of man and machine 
are so different that the computer has such potential ... It is important 
that system designers ... do not try to make the computer compete 
with man in areas in which man is superior.” 

A very important system design principle which is being used to 
great advantage in PREMIS is the concept of modular structure. Each 
separate system function is architecturally built independent of the 
others. This applies to internals (processing logic and database struc- 
ture) and externals (transaction groupings). In addition to the obvious 
benefits of understandability and simplicity, this modular structure 
allows an upward compatibility in later releases. It also allows the 
trialing or “soaking” of a new feature in a limited area before general 
deployment. In a similar way, by working with the target organization 
staff, the existing target organization framework (e.g., methods, work 
flows, job, measurements, etc.) can be examined. User performance 
deficiencies can then be avoided by influencing the change of the 
organizational structure when mechanized support is introduced. This 
is covered later in this paper. 


4.2 Input/output design considerations 
4.2.1 Interactive transaction characteristics 


PREMISs has incorporated a variety of Service Rep 1/o design features 
which help smooth the interaction from a human point of view. Many 
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features were part of the original design; others were designed and 
delivered via field reviews and feedback from the trial users—South 
Central Bell. A few are in the process of being delivered. Listed below 
are those believed to influence user performance and satisfaction. 
Some specific examples can be found in other papers by Hicks’ and 
Ferrer.” | 

(1) Input Mode—Inputs can be classified as either ‘‘coded”’ or 
“prompted.” With coded input, the user supplies the labels and the 
data values; with prompted input, only the data values need be entered 
because the labels are supplied by the computer via form-filling (a 
mask). The latter is recommended for the end user because of its 
inherent advantages in reducing input errors and work time, while 
increasing satisfaction. Prompted input can be further divided into 
“interactive” or “batch.” With interactive prompted, the computer 
waits for the user to enter a data value after sending each prompt; 
with batch prompted, the user fills out the entire mask and the 
computer gets only one transmission. For a computer system without 
local intelligence at the point of entry, the batch-prompted mode is 
much more efficient from a communication system overhead and 
computer usage capacity point of view and was selected for PREMIS. 

(tz) Command Structure—Short, simple, consistent structure em- 
phasizing clarity. 

(tit) Single Display Frame (Input)—Transaction inputs always 
limited to single screen (mask); no input paging required. The partic- 
ularly awkward paging design associated with the MOD 40/4-system 
software combination made this very important. 

(tv) Single Display Frames (Output)—Transaction outputs usu- 
ally limited to single screen (mask); however, in some cases, in 
“prompting” mode, multiple output pages can be returned to the user. 
Because paging commands require full transmission back to the 
computer, the transaction response and human search time delays are 
proving impractical for more than a very few pages. A revised design 
includes providing more limited output via either more selective data 
returned or asking the user for more input data to narrow the scope of 
the stored database information. The concept of putting the most 
likely choices on the first output page has obvious benefit, but as of 
yet, such an algorithm has not yet been implemented for PREMIS. This 
issue 1s covered in more detail later. 

(v) Discrete Transaction Per Function Design—Several inde- 
pendent subfunctions requiring computer assist each has their own 
independent transactions. This is in support of the user’s need for 
closure. In addition, this allows optimization of transaction queue 
control to maximize the priority of the end-user transactions. 

(vt) Transaction Sequencing—While transactions are discrete, 
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each output screen is designed so that over-typing a portion of the 
input control field, leaving the desired data, adding new data and 
hitting the SEND key can request the next transaction. This achieves 
a high level of “Reusability”’’ of former inputs and outputs. 

(vit) Cursor Movement—Nondestructive forward and backward 
position-by-position movement and programmable tabbing for auto- 
matic movement to the beginning of next and previous input fields. 

(viit) Prompting—Software logic which detects some shortage of 
input information and requests the user to provide additional input. 

(zx) Menu Selection—A type of prompting where a limited set of 
valid responses is presented on the screen and the desired response 
can be chosen by keying in the choice, the number of the choice, or by 
positioning the cursor next to the choice. 

(x) Parameter Defaults—For each community of interest (e.g., 
city or state), a set of table-driven parameter values which represent 
an agreement by the user on what are normal, or the most often used, 
values that can be assumed if the field is left blank. 

(ct) Minimal Input—The user need enter only a “shorthand” 
input, and the computer employs some pattern recognition logic to 
fully interpret the input. (This feature has sensitive system perform- 
ance implications and is discussed later.) 

(xit) User Control—The user initiates and controls all human/ 
computer interactions. 

(xtit) User Messages for Transaction Failures—Currently, the 
screen goes blank for certain transaction system failures and uninform- 
ative messages are returned for others. This is a serious cause of user 
discomfort. Meaningful messages should be returned in every case, 
and efforts are underway to make this possible. 


4.2.2 Multisystem considerations 


(i) System-to-System Switching—Currently, the Service Rep 
has access to another mechanized support system in addition to PREMIS 
via the same terminal. During the same work session, one switch (or 
more) between systems is often necessary. The current log-off-log-on 
procedure involves several keystrokes and response waits. Serious user 
dissatisfaction with the procedure has led to the recommendation of a 
quick-switch procedure of perhaps just one function key. In addition, 
the entry mask for the appropriate system being logged on to should 
appear automatically on the screen. 

(ii) System-to-System Consistency—There exists system-specific 
function keys and dialogue mneumonics for each of the systems 
accessed by the same user. Lack of system-to-system consistency is a 
training problem and a potential source of error. Multisystem agree- 
ment on design is currently being negotiated. 
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4.2.3 Screen display physical characteristics 


Most terminals in the field today generally provide acceptable 
physical characteristics from a human performance point of view. In 
the case of the Dataspeed MOD 40/4 used for PREMIS, the users 
reacted favorably when asked about screen brightness and contrast 
and character size, sharpness, and spacing. The only negative reaction 
to the screen was the eye strain associated with users whose terminal 
screen faced a nearby window where the sun glare was strong. This 
was easily remedied by drapes or blinds. 


4.2.4 Information display characteristics 


In terms of information display, the PREMIS design was open to 
human performance considerations. A variety of human-engineered 
screen information display principles were followed in the PREMIS 
design.*° These design features were incorporated to improve the 
cognitive behavioral response of the user, mostly by minimizing data 
searching, Increasing awareness, and by increasing recognition and 
distinction. PREMIS users gave very high values on the ease of use of 
this information display (8 to 10 on a scale of 1 to 10, 10 being the 
highest). However, no experimentation has yet been done on PREMIS 
to determine sensitivities to varying display techniques. 

(t) Positioning of Input Data on Screen—As close as possible, 
input data should be entered in a left-to-right and up-to-down sequence 
consistent with the work operations. 

(tz) Input Data Preservation—AlIl input data appears on the out- 
put screen completely and precisely in the same position. 

(wiz) Positioning of Output Data on Screen—All data displayed on 
an output screen (mask) always appears in the same spatial position 
in multiple operations to minimize user search time, “reduce disruptive 
movement and help highlight the impact of the last operation.” “ 

(tv) Highhghting—For key output parameters, where particularly 
high detectability is desired, highlighting is used. 

(v) Data Labeling—AlIl variables on 1/0 screens appear as pair- 
wise identifiers and values. 

(vi) Number Displays—Long number sequences in the same field 
are broken down into subsets. 

(vit) Screen Partitioning—Separate, dedicated areas for input, nor- 
mal output, and user messages. 

(vit) User Message Semantics—All user error control and/or in- 
structional messages are constructive and supportive, not condemning 
and confrontive.® © 

(tx) User Message Syntax—aAll user error control and/or instruc- 
tional messages are in English (no cryptics/codes). 
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4.2.5 System engineering considerations 


While ensuring understandable, simple human/computer interface 
features, the designer must also consider the effect of these features 
on the hardware/software capacity of the system. Gilb notes, “Exten- 
sive use of humanized input designs can easily result in substantially 
greater consumption of central processing cycles and secondary storage 
search time over programs which effectively place the equivalent work 
processes on human beings.”* Care must be taken to specify 1/o design 
features which have gone through the human/system engineering step 
of identifying a favorable payoff versus penalty ratio. This important 
design activity 1s not always done in computer systems and, for some 
features, in the PREMIS project also, subsequent field analysis has 
sometimes suggested the wrong emphasis. 

For example, the misuse of prompting and menu-select features 
associated with interactive transactions can be a potential source of 
unexpected (and undesired) induced load. The additional transactions 
generated from these features can significantly degrade the system 
load capability. That is, the payoff from reduced training costs, reduced 
keystrokes, and reduced entry errors may not offset the economic 
penalty in increased transaction load. A recent example of this is the 
minimal input feature in PREMISs.’””® 

This feature allows the Service Rep to key into the computer as few 
as the first four characters of a street name. The computer searches 
the database for a match and, if found, returns the desired information. 
If more than one internal match is found, a menu is returned to the 
terminal screen and the user, after perusing the menu, selects his or 
her choice and resends the transaction. Training was provided on how 
to use this feature but not when to use it. Analysis of audit tapes 
showed that, for some locations, 50 percent of the time the menu- 
select response was happening. Not only was this causing an unac- 
ceptable additional load on the system, but, the user was paying a 
penalty in increased time per event. This is evident from Fig. 1. Figure 
1a shows that if a second transaction per address is required more than 
30 percent of the time, it takes more time on the average to use 
minimal input than to use full spelling. Figure 1b shows the rate at 
which transaction volume increases for additional transactions per 
address beyond the 15 percent error rate assumed for full spelling. 

The potential user and system costs of what was intended as a 
humanized input design was uncovered in an operational review of the 
system. However, this same review found considerable variability 
between locations on the percent of time that minimal input required 
additional transactions. These findings suggested that rather than 
remove the feature altogether, the objective should be its efficient 
utilization. Work was undertaken to develop the software analysis and 
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Fig. 1—-Impact of minimal input. (a) Service Rep time. (b) Transaction volume. 
(More than one transaction is required to get a match.) 


monitoring tools which could be applied per location, so that usage 
guidelines per location could be developed. Until these software tools 
are available, however, the potential cost was deemed too great and, 
as a short-term remedy, the recommendation was to stop using mini- 
mal input. 

Another issue which was not properly engineered in the initial 
delivery of PREMIS was establishing and enforcing limits on the amount 
of computer processing or output pages which any individual Service 
Rep transaction could consume. Assumptions were made about real 
world data which proved not to be true. As a result, information 
retrieval transactions were allowed which searched vast areas of the 
database for a “hit.’’ After minutes of chewing up processing time, 
having failed to find a certain hit, a very large number of possible 
menu-select choices are output back to the Rep’s terminal requiring 
many pages to go through. In an environment where the Rep is 
negotiating with a customer on the telephone, these occurrences re- 
sulted in Rep frustration leading to a bypass of PREMIS altogether. A 
careful review of these cases resulted in changes in the software logic 
and controls to include firm processing and output upper bounds for 
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certain transactions, while providing user messages and enough infor- 
mation to proceed through the customer contact. For example, it is 
often possible for the user to change the input in such a way that the 
search is narrowed and the output limited. (To illustrate, if a Rep is 
dealing with a customer moving into an apartment complex with many 
units and the units are identified by a combination of letter and 
number, say Apartment A-162, the customer would often state their 
address as 162-A. Rather than output all apartment numbers as 
possible matches, a quick perusal of a single output page can quickly 
identify this inconsistency which can be easily handled by overtyping 
the reversed characters and resending the transaction. 

Unfortunately, the software processing safeguards were not applied 
to all potential overload situations. Providing all possible database 
matches on the first four characters of the keyed-in street name caused 
the entire system to go down several times in the first few weeks of 
1981. These occurrences were all associated with a geographic area 
which happened to have many highways. By matching HIGH, all 
highways were attempted to be sent back to the user. However, the 
software output buffer could only handle 200 of these in one transac- 
tion, and this number was exceeded. The software had no programmed 
safeguards against this overload, and it caused the whole system to 
lock up. To make matters worse, no output message identifying the 
specific transaction or the condition was sent to either the user or the 
computer center. To relieve this condition, the software processing 
was changed to limit potential matches to 50 and, if more than 50 
occur, send none back to the user; instead, provide a new user output 
message stating “NO EXACT MATCH ON ADDRESS ENTRY— 
TOO MANY SIMILAR ADDRESSES TO DISPLAY.” 

In the same vein, other obvious user-oriented 1/o features can cause 
substantial induced load on the system. Two examples which should 
not be incorporated into the design without careful analysis are (z) 
allowing the end user access to a status transaction which provides 
what state a previously entered transaction is in, and (iz) allowing the 
end user to enter later transactions without first receiving a response 
from the first. 

Both of these capabilities tend to further degrade an already over- 
loaded condition. 


4.2.6 Novice versus experienced variations 


Several papers in the field today suggest that there be two varieties 
of 1/0 design: one for the brand new end user, as part of a training 
strategy, and another for the expert user. The novice version would 
have very simplistic inputs and expanded outputs associated with 
prompting, menus, and full-error messages; the expert version would 
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be optimized for minimal load on the system, that is, flexible inputs 
and outputs, but designed to require only a single human/computer 
interaction per task. The idea is to do a “switch” per terminal or per 
end-user identification when a certain proficiency level is reached. It 
has even been suggested that the computer itself track the progress of 
each user, and via an adaptive switch, change the 1/o design from 
novice to experience accordingly. In PREMIS, only one 1/o design exists, 
but for the new user a Training Subsystem is provided which allows 
training transactions to be tracked and processed independently from 
live transactions using a training database. Further field analysis is 
necessary to determine the cost effectiveness of these various ap- 
proaches. 


4.2.7 User errors 


No matter how hard system designers try to make transaction inputs 
simple, user errors will always be somewhat of a disappointing surprise 
when the system is in operation. As part of the system design, there 
should be easy monitoring (mechanized) of error patterns so that some 
action can be taken in response to frequent, costly errors. The system 
modification might be in the semantic or syntactic elements of the 
input or might be solely associated with training. 

A mechanized means of sorting user errors by the Service Rep has 
recently been designed. The statistical information is conveniently 
displayed in report format and is available on request by management 
personnel. This report capability is scheduled to become operational 
in 1981. It is anticipated that feedback to the Service Reps on their 
‘error rates will be a positive influence on their performance and 
attitude. A user error rate objective standard will be established as 
soon as a substantial amount of data is collected and analyzed. This 
standard, which may vary depending on local conditions, should also 
be made known to the Reps. Experimental results’’ have shown that 
in a repetitive human/machine interface task, the combination of 
feedback on both individual performance and an accepted (high) 
standard can have significant positive effect on reducing user errors, 
while uplifting individual morale and uplifting the organizational cli- 
mate. A 15 percent decrease in the average error rate was ultimately 
achieved. In addition, the results of observations and interviews sug- 
gests that the increased pressure on performance did not cause any 
undue physiological or psychological stress. 


4.3 Operational considerations 


In my experience, three computer operational considerations are by 
far the most significant: availability, transaction failures, and transac- 
tion response time. 


HUMAN PERFORMANCE ENGINEERING. 779 


The users should be (but usually are not) prepared for the effect of 
these on their job performance. Our experience has shown that these 
factors have a potentially drastic and traumatic effect on the user, 
especially when they first use the system. They must be prepared for 
these effects. 


4.3.1 Availability 


The availability of interest to the user is the end-to-end variety; that 
is, the probability that all components of the system are functioning at 
the time the user desires access. 

Repeated availability problems caused by equipment outages can 
result in long-term attitude problems. If a terminal, for example, goes 
down often (low mean-time-to-failure), it not only interferes with work 
performance during the outages, but it is also viewed with distrust 
when it becomes workable. This results in the user checking and 
double-checking results constantly, thus, increasing task time. 

Often, however, there are socio-technological obstacles in the way of 
providing realistic availability information to potential users. For ex- 
ample, an on-site review associated with FAcS* uncovered the follow- 
ing:'® 

Overall Application User Opinion: “The system is down on the 
average of 3 hours per day!” 

Operations Staff: “Not so! Availability is in the high 90’s!”’ 

How is this widely different view possible? The system administrator 
is generally measured on how often and for how long the system is 
down. Therefore, it is highly unlikely that availability—as seen at the 
user’s terminal—is either known or, if known, publicized. The tradi- 
tional fallacy is the official reporting of “computer main frame down- 
time during business hours.” There are two serious problems with 
this: 

(tz) As mentioned earlier, care must be taken to deal with end-to- 
end availability—the product of on-line hardware/software/commu- 
nications links reliabilities. That is, the terminal, the transmission 
lines, the message switcher(s), etc., must all be considered to get a true 
picture of user availability. 

(ti) Often, just as much work is done during off hours to keep the 
system up to date. Availability can be just as important here. 

A classic case of a difference between perceived availability (by the 
user) and measured availability (by the operations staff) occurred 
during an on-site operational review of Facs.’® At the user’s terminals, 
on many occasions during the day, transactions could not be entered 


* Facs: Business Information Systems Customer Service/Facility Assignment Sys- 
tem, FACS is a Bell Laboratories-developed mechanized support system, an early version 
of which was trialed in the late 1970s and is now being redesigned. 


780 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1982 


into the computer. On the other hand, the official availability of the 
computer for the same time frame was in the high 90s. After intensive 
investigation, it was found that a major cause of the confusion was 
tape handling. As part of the recovery procedures, audit tapes are 
generated saving a transaction history. These tapes would each handle 
about 40 to 60 minutes worth of activity. After each tape was full, it 
had to be manually removed from the tape drive and a blank tape 
mounted. This takes up to 3 minutes per occurrence. During these 
time intervals, no user transactions could be accepted by the computer; 
however, they were not considered outages in terms of reporting 
availability. 

A very interesting user behavioral effect results from these multiple, 
very short outages. Say, a user attempts to enter a transaction during 
one of these outages. Normally, a “message received” appears on the 
screen; in this case, the screen goes blank. The user mentally records 
that the system is not available, but does not know exactly how long 
this condition has existed. Suppose now, another try is made. Again, 
a blank screen. The user gives up, walks away, and does something 
else. Thirty minutes later, the user walks over and tries again. Nothing. 
Another mental note: ‘““The system’s been down anywhere from a half 
hour to an hour.” Two outages of maybe 3 minutes each resulted in a 
magnified user performance degradation. If nothing else, this phenom- 
enon suggests that many sporadic, short outages can have a signif- 
icantly greater effect on user performance than a single, extended 
outage. For extended outages, telephone calls were generally made 
from the computer center to each user work center informing them of 
the situation thereby limiting the perceived availability problem to the 
actual. (The audit tape problem was subsequently relieved by the 
introduction of a mechanized transfer process.) 

During a PREMIS field visit, the negative impact on the Service Rep 
of many short outages was further magnified. Over a period of 4 hours 
there were at least 6 system outages of 5 to 20 seconds each. The total 
actual time the system was unavailable was less than 1 min. (None of 
these occurrences was officially included in availability statistics be- 
cause the minimum outage reportable was 5 minutes.) However, every 
Rep accessing PREMIS when an outage occurs is also on the phone to 
the customer. There is, of course, no way of knowing when the system 
will again be available. The Rep can either choose to bypass PREMIS 
and resort to back-up procedures (extra time, motion, and chance of 
error) or skip the portion of the customer contact associated with 
PREMIS and hope the system comes back quickly. The latter was 
generally done. However, the Rep must formally log back on to PREMIS 
each time and either call up or clear out of the system any message or 
partial messages that were in process at the time of the outage. This 
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might take a minute or two to accomplish. The cognitive load associ- 
ated with these required actions in concert with verbal continuity of 
customer negotiations created obvious stress on the Reps. They were 
visibly disturbed and often made keystroke errors, as well as wrong 
decisions. Worse, the fact that these outages were not counted (because 
of their short duration) as official availability measurements put no 
pressure on the data systems organization to investigate the problem 
and attempt to resolve it. 

Another important issue is the fallacy of partial availability. It has 
been my experience that the hardware/software portion of the system 
does not “fail softly” as Martin puts it. No matter what mechanized 
fall-back processes are built in, the user is usually dramatically af- 
fected. What is needed mostly, I believe, are not gradual, stepwise 
degraded user procedures, but a simple alternate design that bypasses 
the computer altogether. The user must be able to conduct business 
for periods of time when the computer is totally down or not available 
anyway. 

Simply waiting for the system to come back up is not acceptable in 
dealing in real-time tasks with a third party (the customer) such as 
the Service Rep’s job. In effect, the fall-back position closely resembles 
the procedures and aids which were in the pre-PREMIS environment. 
After these procedures are designed and tested, they should be in- 
cluded as part of the formal training. In PREMIS, the use of periodic 
printouts (albeit somewhat out of date) keeps the business going 
reasonably well. 


4.3.2 Transaction failures 


Transaction failures are defined to be cases of user-initiated trans- 
actions which do not process to successful conclusion. The causes of 
these failures can be hardware-related or software-related. A high rate 
of these failures, especially if combined with poor user notification, can 
seriously affect user performance. An example of such a situation was 
observed with Fracs. Table I shows representative failure statistics.” 

The causes of these failures were spread among management soft- 


Table |—Failure data 


Failure 
Total Rate 
Transac- Total (Per- 
Month tions Failures cent) 
August, 1978 140,770 2873 1.8 
September, 1978 135,720 2992 2.2 
October, 1978 158,175 1987 1.3 


(For comparison purposes, several other large-scale sys- 
tems manage to maintain failure rates of less than 0.1 per- 
cent.) 
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ware bugs, application software bugs, and database logical and physical 
inconsistencies. The effect on all three classes of users was serious: the 
operations staff received all the failure messages independent of the 
transaction entry source. They were inundated with paper each day 
and generally were unable to analyze, pattern, or react in any effective 
way. The paper continued to stack up each day so there was virtually 
no real-time support provided. This left them frustrated at their 
obvious ineffectiveness. Each aborted transaction was taking up to 
eight days to evaluate. The database maintenance clerks were contin- 
uing to do their database changes generally unaware which of their 
transactions “bombed out,” because they were not notified by output 
messages or calls from the support staff. Often, they would discover 
these events by later transactions not processing properly, that is, the 
database change may have tried to load some inventory but since it 
never worked, later accesses against this inventory would fail. This 
caused frustration on their part because they would receive calls from 
the application users when their transactions did not work. The 
application users were negatively affected because their ability to serve 
the customers’ needs in a timely way was degraded. All of the users 
who initiated transactions that failed were further frustrated because 
failure messages were not returned to the entry terminal! The lack of 
job continuity and efficiency, because of these transaction failures, was 
having a measurable effect on the acceptability of the system from a 
user point of view. A variety of fixes were designed—the prime effort, 
of course, focused on improving the quality control of the transaction 
processing. In addition, the transaction failure messages (in English!) 
were returned to the initiator’s terminal, and extensive training and 
documentation was developed to help people cope with these events, 
including straightforward error correction procedures. 


4.3.3 Transaction response time 


Transaction response time is defined to be the interval between 
sending the transaction until the first element of the response appears. 
It has many of the same characteristics as availability and transaction 
failures in trying to get a true handle. The software designers are 
sensitive to poor transaction response time perceptions because these 
perceptions imply’ less than optimal programming in many cases, or 
the system administrator may have installed a queueing structure that 
is inefficient from a user point of view or the communications manager 
may have installed a poorly balanced communication layout. 

It should be noted at this point that fast response time is not cheap. 
Neither computer processing power or telecommunications network 
capacity can be indiscriminately increased for fear of negating the 
economics of the system itself. Therefore, extreme care must be taken 
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in specifying response time objectives. In PREMIS, the various trans- 
actions are grouped according to specific users and discrete response 
time requirements are established for each grouping. A table-driven 
queueing structure built into the computer management system sup- 
ports this structure. In addition, communication lines vary from dedi- 
cated to a single terminal to heavy time sharing with others. Martin 
points out—and I agree—that, in practice, justifying response time 
based on pure time economic considerations is virtually impossible. It 
is the psychological factors which can affect user performance. These 
factors include “‘expectance,” “chunking,” “short-term memory,” “clo- 
sure,” etc. For example, experiments have shown short-term memory 
limits are severely stressed when the response time increases. This 
stress results in user discomfort, as well as increased error rate. 
Experiments have also shown that user annoyance is very low if the 
response time is under 8 seconds; some annoyance appears from 8 to 
13 seconds, and the user becomes very annoyed if the response time is 
greater than 13 seconds.” This is very consistent with Service Rep 
reaction on PREMIS. Williams reports on experiments which show that 
for fairly routine data entry tasks “User performance deteriorates with 
system delays of greater than 15 seconds.””’ Other experiments have 
shown a discontinuity at 15 seconds. It becomes more than an annoy- 
ance and disrupter—it becomes a demoralizer and reduces motivation.° 
I do not have statistical data on user error rates in PREMIS as a function 
of transaction response time. However, a recent study by Barber,”’ 
showed that for a response time of, on the average, 8 seconds, the 
subsequent transaction had an error rate of about 0.10; when the 
response time was increased to 20 seconds, the error rate increased to 
0.25. For the case of the Service Rep position, Table II gives recom- 
mendations based on qualitative behavioral observations (mostly my 
own judgmental estimates of disturbance levels). These values are also 
based, unfortunately, on systems without local terminal intelligence so 
that every user action requires full transmission to the computer: 

The recommendations given have the built-in presumption that the 


Table II—Response time objectives 
During Busy Hour 


(90 Percent Confi- 

Action dence) (Seconds) 
Bring-up screen 4 
Paging 4 
Input error (syntactic only) 5 
Input error (all others) 7 
Simple transactions (direct access key to data) fi 
Simple transactions (limited search in data base) 10 
Complex transactions (extensive search in data base) 15 
Transactions allowing parallel tasks 25 


784 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1982 


1 2 3 4 5 6 7 8 9 10 


EXTREMELY (a) VERY 
SLOW; PROMPT; 
INTERFERES WITH SATISFIES 
MY JOB MY NEEDS 

DURING DURING 

THE SLOW WORK 
BUSY HOUR PERIODS 
ei (1% 


1 2\3) 4 5§ 6 7 \B/ 9 10 
(b) 


Fig. 2—(a) Original scale for Service Reps opinion of response time. (b) Revised scale. 


Rep is willing to wait somewhat longer if they perceive the computer 
task to be more difficult (simple versus complex transactions) or the 
system is in a high load condition (busy hour). In other words, the 
user discomfort level is related to the extent to which the real delay 
exceeds expectation. 

Dealing with response time as seen by the user can have some very 
interesting and important quantitative and behavioral effects. Re- 
cently, the PREMIS users In Memphis reported the response time for 
simple transactions to be about 30 seconds, while users in Birmingham 
and Jackson reported a 7-second average. An examination of the 
terminal and communications network layout revealed that there were 
excessive terminals in Memphis competing for the same line. 

A 1979 PREMIS field review in Jackson, from the Service Rep’s 
perspective, yielded the following: 

The Service Rep’s attitude towards response time was obtained via 
formal questionnaires. However, a real-time adjustment to the ques- 
tions had to be made. Originally, the Service Reps were asked to 
indicate their opinions of response time. They were asked to choose an 
answer based on the scale shown in Fig. 2a. 

It quickly became apparent that the Reps were having great diffi- 
culty with this request. Informal probing led to a decision to pose the 
request in two parts (See Fig. 2b.), which ultimately provided a true 
picture when the stop-watch measurements were analyzed (See Fig. 
2b.) 

Stopwatch measurements were extensively taken at three different 
user positions over a one-week period. In addition, the users’ behavior 
was observed, and they were requested to complete questionnaires. 

Tables IIIa, IIIb, IIc, and [Id provide some key statistical summary 
data on response time for the most important transactions. Referring 
to the table, a success, or “hit” condition refers to a user transaction 
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Table IIl—Response time characteristics 
(a) (c) 
Variations across locations (means) Variations across transaction outcomes 
Computer Stopwatch Stopwatch Times 
Terminal Loca- Time Time re 
tions (Mean) (Mean) Mean Mode Ranges 
Hits (second) A 4,39 8.87 Hits (second) 10.68 6.0 5-60 
B 3.90 12.09 No-hits (second) 
C 4.80 10.38 Outcomes 
Average 4,27 10.20 1 9.61 9.3 5-19 
No-hits (seconds) A 5.93 8.15 : ee | ie 
B 6.62 13.63 4 1168 3.0 5-9] 
Cc 8.45 11.35 ’ 7 
5 11.86 7.0 6-27 
Average 6.56 10.65 6 9.65 7.0 6-18 
7 11.43 11.0 7-26 
(b) 8 10.00 6.5 5-23 
9 29.64 26.0 22-54 
Variations across locations (details) 10 11.23 7.0 6-30 
Termi- Stopwatch Timings 
1602 SSS 
cations Mean Mode Ranges (d) 
Hits (second) A 8.87 6 5-29 lati i 
a ns aes Variations across business day (means) 
C 10.38 . 6 5-39 9-10 10-11 10-12 12-1 1-2 2-3 3-4 4-5 
No-hits (second) A 3.15 7 5-39 am am am pm pm. p.m. p.m. p.m. 
B 13.63 li 6-54 Hits (second) 6.3 102 125 70 87 #115 75 °£481 
Cc 11.35 12 5-35 No-hits (second) 106 175 12.2 88 93 98 99 91 


which returned the requested information to the user; a “no-hit” 
condition implies an undesirable result usually accompanied by an 
error message. The response time objective was 7 seconds. Table IIIa 
shows the variation in mean response time across users’ locations for 
both user response time (measured by stop watch) and computer 
response time (automatically measured by the computer, representing 
only the main frame residence time). Note that the use of the computer 
response time as the only official reportable number had provided a 
false sense of security. The variations between user sites is significant; 
therefore, a more detailed statistical breakdown is shown in Table 
IIIb. (It turned out that the computer polling algorithms were out of 
balance causing these variations). The response time variability be- 
tween transaction outcomes is shown in Table III (one specific result 
was the redesign of the software processing for the ninth no-hit 
condition). Finally, Table IIId shows the variation in response time 
across the business day. It showed the longest response times before 
and after lunch time which is consistent with the known customer 
activity level variations (this information resulted in recommending to 
the maintenance and computer center staff that offline transaction 
activity be avoided during these time periods). 

Based on my observations and later interviews, I believe the wide 
variability in response time for different outcomes was apparently at 
least as troublesome to the users as the mean response time. This is no 
longer surprising based on several recent experiences and experiments 
appearing in the literature. First of all, what variability is detectable 
by the users? A recent experiment by Butler” showed that for a 
response time in the 8-second range, a 2-second increase was consis- 
tently detectable by 75 percent of the subjects. Now, in terms of 
discomfort, Miller notes “Increasing the variability of the output 
display rate produces both significantly decreased human performance 
and a poorer attitude toward the system and the interactive environ- 
ment.” The results of this experimentation showed that user satisfac- 
tion, measured on a relative scale, decreased by 25 percent as the 
response time variability went from low to high. In addition, the 
amount of time required by the user to analyze the output and take 
action increased by 15 percent under the same conditions.° In addition, 
not preparing the user for this compounds the negative effect. Shnei- 
derman, in reviewing several experiments, states “If the variance of 
response time is small, users incorporate the waiting into their work 
patterns by pre-planning future queries or attending to other functions, 
but if the variance is large, users must maintain a continued high level 
of awareness and become anxious if response time grows.” Quantita- 
tively, Martin recommends as a rule of thumb that the standard 
deviation of response time should not be more than half the mean 
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response time. Therefore, for an 8-second mean, two-thirds of the real 
response time should be less than 12 seconds. 

Finally, a condition with both flaky transaction failures and slow 
and erratic response time is more damaging to user performance than 
the sum of each effect. A recent review of PREMIS, which had such a 
combined condition, disclosed high-user frustration at not knowing if 
a blank screen meant something wasn’t working or the system was 
just slow. On-site observations showed that they would often send the 
same transaction several times in succession, thereby compounding an 
already overloaded input queue. This phenomena of an induced load 
was supported by data on audit tapes. An analysis of the data” 
revealed that as the system approached the busy-hour, users repeat- 
edly hit the send key in their frustration at the slow response time. 
The Service Reps openly admitted that they were doing this. Figure 
3 shows the measured response time characteristic versus offered load 
for the system as it existed in September, 1980. The effect of the 
induced load on response time is shown for a particular Monday 
morning busy hour, where a serious computer problem caused repeti- 
tive entry of very long running transactions. In these cases, each 
transaction was, in effect, worth many normal transactions. It is 
estimated, based on audit tape data analysis, that this user-initiated 
phenomena combined with transaction processing difficulties in- 
creased the average response time from a normal 10- to 20-second 
range to a 60- to 70-second range. 


V. TARGET ORGANIZATION/ENVIRONMENT CONSIDERATIONS 


It should not be surprising that the performance of a very large 
computer-based application can be greatly affected by (and can greatly 
affect) the target organization/environment. Every attempt should be 
made to analyze this impact up front and force appropriate adjust- 


80 
70 
60 
50 
40 


30 


TIME IN SECONDS * 


20 


MEAN TRANSACTION RESPONSE 


10 
>| EXPECTED 
RANGE 


0 
0 10 20 30 40 50 60 70 
SYSTEM OFFERED LOAD IN TRANSACTIONS PER MINUTE 





Fig. 3—Load characteristics. 
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ments in the organization or environment to allow a graceful introduc- 
tion of the computer system. The first consideration—the relationship 
of the computer tasks to their appraisal through existing job perform- 
ance measurements—is extremely important and often underesti- 
mated, especially for the application user. This can be the focal point 
of serious user performance problems. The Service Rep job is one such 
case. 


5.1 Relationship of existing user job performance measurements to use 
of the computer 


There are three major possibilities here: 
(t) Does the Rep perceive the computer system to have a positive 
effect on their performance measurements? 
(zz) Are the measurements themselves tied to bottom-line system 
performance objectives? 

(zit) Are the Reps provided feedback on how well they are doing? 

Assume a Service Rep is negatively measured, at least in part, if the 
customer contact is broken off for any reason and the customer is put 
on hold. In a pre-PREMIS environment, paper records are accessed 
during the contact; with PREMIS, the computer is accessed via the 
terminal. The computer-based records may be more accurate than the 
paper records. This guarantees better service to the customer which 
presumably has positive satisfying value to the Rep. But the more 
significant behavioral influence on the Reps is the effect of the com- 
puter on their job performance measurement. So, for example, if the 
system response time is so slow that Reps believe that customer 
contact breaks will increase, they are going to be unhappy. 

An on-site review of the Service Rep’s use of PREMIS was held in 
Jackson, Mississippi, in 1979. Under the existing measurement struc- 
ture at that time, breaks in customer contact played an important role. 
That is, Reps with frequent and lengthy contact-time breaks would be 
negatively effected. Personal interviews with the Reps included the 
question, “How do you perceive the effect of PREMIS on your average 
contact time? Their verbal responses generally showed a keen sensi- 
tivity to the issue and a uniform distress over the transaction response 
time degradation in the busy hours (the observation of response time 
as a sensitive behavioral characteristic will be further amplified in the 
next section). There is no easy way out of this without cooperation of 
the target organization. A slight increase in contact-time breaks and 
break durations may, of course, be a small price to pay if a substantial 
reduction in key information errors occurs which ultimately translates 
into reduced total work or some other bottom-line measure. One such 
measure is input error frequency. 

A data collection summary” of the effect of PREMIS on the frequency 


HUMAN PERFORMANCE ENGINEERING 789 


of errors in an important parameter makes this point. One of the items 
that a Service Rep must determine is the address of a customer who 
wants service so that the electrical connections can be assigned and 
hooked up to the right house, as well as provide the downstream 
accounting people with the right place to mail bills. The Service Rep 
places the address on a contact memo which later is used to generate 
the formal service order. If the address is incorrect, a variety of 
difficulties can develop, depending on how long the error goes without 
being detected. If the error is caught early, the order is sent back to 
the Rep to change and re-submit; if the error succeeds in surviving the 
entire service provisioning process, billing discrepancies may occur. In 
any case, there is substantial cost associated with each of these errors. 
One objective of PREMIS is to provide the Rep with on-line address 
verification support during the customer contact, thereby avoiding 
address errors up front. Table IV summarizes the history of address 
error statistics since PREMIS was cut live in Jackson, Mississippi. The 
precut-live error rates were on the order of 10 percent in 1978. 

It is generally acknowledged that PREMIsS—both from an ease-of-use 
and a database-accuracy point of view—allowed this dramatic im- 
provement. 

The key is to revise the job performance measurement closer to the 
bottom line instead of to a secondary parameter. This is easier said 
than done, especially if this condition is not recognized before the 
computer system is installed. 

Recently, the Service Rep job performance measurements were 
changed. The emphasis is now on customer service rather than on the 
contact time. That is, questionnaires on how gracefully and efficiently 
the job is being done, from both a behavioral and efficiency point of 
view, are now being sent directly to customers on a per-Rep basis. If 
an address error was not caught early and proliferated all the way to 
the installation step, the customer would be negatively affected (by 
the missed appointment) and would so report. This change of emphasis 
on individual performance measurements was implemented totally 


Table !\V—Address error trend 


Orders 
With Ad- 
Total Ser- dress Er- 

Quarter vice Orders rors Percent 
1Q79* 08,707 3,919 6.5 
2Q79 77,746 4,570 5.9 
3Q79 80,800 4,154 5.1 
4Q79 69,217 3,305 4.8 
1Q80 68,717 2,730 4.0 
April-May 47,167 1,388 2.9 


* Premise begins live operation. 
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independent of PREMIS; however, the beneficial effects to PREMIS are 
obvious. 

Of utmost importance is the direct feedback to the user of both the 
performance objective and the user’s personal performance. In the 
famous Hawthorne investigation on worker productivity, the investi- 
gators found that the experimental subjects demonstrated improved 
performance regardless of the variables. Many years later, H. M. 
Parsons concluded that the two major influences on the workers were 
({) they were given frequent feedback on their performance, and (iz) 
they were aware that their performance was directly tied to their 
salary increases.” 

In addition, the Service Rep is now handling new functions on a 
PREMIS environment. The associated new tasks, of course, take addi- 
tional time; but from a total organization point-of-view, the functions 
are being performed more efficiently. This very important considera- 
tion is discussed next. 


5.2 Relationship of the existing organization-to-function mapping to an 
optimal strategy 


In an organization/environment where the work is broken down into 
several discrete functions performed by discrete work groups, the 
flexibility to re-distribute certain tasks across traditional boundaries 
can often improve the effective use of a computer system. A case in 
point are the traditional organizational entities in the Boc, where 
several vertical management structures each are involved in the proc- 
essing of a horizontal customer service order. To illustrate the example, 
consider the roles of three of these organizations: The establishment 
of the service order (Service Reps); the assignment of facilities 
(assigners) and; the installation of the facilities (installers). PREMIS 
provides on line, up-to-date information up front to the Service Rep 
associated with the service history at a particular address. Now, one 
specific issue for which PREMIS can provide benefits is the interfering 
station condition. Simply put, it is a request for new telephone service 
at an address where previous service has not been terminated. This 
most often occurs when the new customer’s request for service is 
processed before the processing of the disconnect service order of the 
existing customer at the same address. Before PREMIS, this condition 
was often not detected until the installer goes to the address, thereby 
causing significant extra work by the Rep, assigner, and installer. With 
PREMIS, the information about the existing customer is displayed on 
the screen while the Rep is negotiating with the new customer. Now, 
if the Rep’s job can be expanded to resolve this condition right up 
front, significant wasted work can be avoided downstream; that is, a 
modest increase in the Rep’s work load is certainly justified from a 
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Table V—tInterfering station condition (isc) workload analysis. As- 

sumptions: Percent of inward service orders for which an Isc exists 

is 7.3%; percent isc identified by Service Rep with PREMiIS is 92.0%. 
Workload Characteristics Service Rep _ Assigner Installer Other 


Average wasted work effort 12 min 20 min 45 min 10 min 

per ISC occurrence not de- 

tected by Service Rep 

ISC 
Identified by: 
Assigner Installer Other 

Annual benefit per line with 4.7¢ 44g 0.56¢ 

PREMIS 
Annual expenses per line with (1.6¢) 


PREMIS because of increased 
Service Rep work efforts 
Annual benefit* (cost) per ($64,000) $188,000 $176,000 $22,400 
year with PREMIS for 4 mil- 
lion line entity 


* Annual benefits per line are linear. 


corporate point-of-view if a much larger savings is realizable by the 
assigner and installer. An economic study by Hafelfinger,” based on 
on-site observations, interviews, and several months’ worth of data, 
convincingly demonstrated this. South Central Bell initiated new work 
flows in a PREMIS environment across the relevant organizations, 
thereby allowing pre and post data to be gathered. Table V summarizes 
the key results. | 

Thus, per PREMIS system, almost a quarter of a million dollars can 
be saved yearly by optimizing the task requirements across organiza- 
tional boundaries. (It is believed that these figures are conservative 
and much more savings is likely.) 

It is important that the system design and development plan include 
an activity associated with the optimal re-association of functions 
across organizational boundaries. The best vehicle and methodology 
to accomplish this depends on the relationships of the system design 
and customer organization and the complexity and breadth of the 
mechanized functions. A vehicle that has proved effective for PREMIS 
is the formal establishment of a working committee of several BOC 
representatives working under the auspices of the AT&T Residence 
Service Center, with consulting support from the Bell Laboratories 
PREMIS HPE group. The committee recommended a generic approach 
to the changes in existing job functions in the Residence Service 
Center to best use PREMIS. Detailed before and after work flows formed 
the basis of the method used in this activity and proved to be an 
effective device.” An adjunct method—to be used in conjunction with 
work flows—has also shown some advantage for certain environments, 
i.e., the formal groupings of like tasks into a series of work modules.”’ 
These modules have characteristics as follows: 
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(t) Not splittable between people. 
(ii) One (or more) make up a person’s job. 

(tit) Each can be independently assigned to the appropriate group. 

The strategy is to define the work flows (and work modules) as part 
of the system design process early enough to influence the adjustments 
in the target organization’s job structure. The work modules can also 
be used as the basis for organizing procedural user documentation and 
training, independent of job definition. 

Make no mistake, there is a real threat to the success of a computer 
system if the organizational dynamics are not well understood and 
dealt with. The worst possible condition exists when the computer 
system is almost totally supporting one department but dependent on 
another to increase its workload to make the system work effectively 
in a mechanized environment. T’o ensure an effective working system, 
an early corporate, multidepartmental commitment is vital. It has 
been my experience that when it is shown that additional human effort 
In one department can save much more human effort in another 
department and improve the corporate bottom line, with the organi- 
zation doing the extra work getting the credit and a good public 
relations job is done, then the chances of acceptance will be improved. 
This was the case of the Isc discussed earlier. On the other hand, if 
workers in one department are asked to increase (or even change) 
their jobs to make it easy to program a computer helping another 
department there is extensive resistance. 


5.3 Customer interface 


Another potentially important environmental influence on user per- 
formance is the customer interface. This is appropriate when the 
performance of the user tasks includes interfacing with the customer 
of the service being provided. This condition exists for the case of the 
Service Rep where the customer is initiating action with respect to 
obtaining (or changing) telephone service. 

An earlier example discussed the potential problem of the target 
organization using customer contact time as a measure of a Service 
Rep’s job performance. This presumes direct involvement with the 
customer over the telephone. | 

In the telephone work mode, the Service Rep is visually interpreting 
screen responses from the computer, while simultaneously listening to 
the customer provide additional data all under contact-time pressure. 
We know from experimentation” that there are load stress bounds 
beyond which there can be some degradation of human performance 
if, during a person’s job, he or she is time-sharing between auditory 
and visual inputs. 

PREMIS is also being used in an environment where the Rep is face- 
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to-face with the customer. Now the behavioral damage to the Rep of 
a slow computer response time is compounded. The visibility of the 
interaction to the customer has been dramatically extended, thereby 
introducing a “‘fish-bowl” effect on the user. Over the phone, perhaps 
the Rep can disguise the delay by gathering additional information for 
a time, and then, out of desperation, discuss the weather or last 
Saturday’s football game. In a face-to-face contact, I have seen the 
Rep and the customer staring nervously at a blank screen for a time 
that seemed like forever, but which was only for about 20 to 25 seconds. 
This was very disturbing to the Rep, especially if the response time 
increased during the busy period, with other customers waiting in line 
(also watching the blank screen). 

Corrective strategies to minimize the pain in this environment had 
to do with adjusting the work station layout and changing the work- 
flow steps. When possible, screens were not made visible to the 
customer. This was relatively easy to do when the customers and Rep 
were separated by a counter or barrier. For layouts where there is free 
flow of customers throughout the work area (as in many Phone Center 
stores), the terminals were put off in a corner and accessed privately 
by the Rep after he or she had collected the pertinent information 
from the customer on a scratch pad. 


VI. SUMMARY 


The introduction of a very large computer-based application system 
to an existing work environment provides the opportunity for enhanc- 
ing the application user’s job performance. However, an awareness is 
necessary of the special computer considerations associated with the 
organization/environment pressures, the interface with the customer, 
and the user’s own behavior. The features of the mechanized support 
should be carefully tailored to make the human/machine interface as 
effective as possible. Principles used in the functional and operational 
design of PREMIS—with the Service Rep as the gc Ls sim- 
plicity, consistency, and clarity. 

The field experience gathered over the last two years has shown that 
certain operational aspects of PREMIS can have a cumulative, very 
potent negative effect on user performance. As reported by the Reps, 
these include “flaky” availability, slow response time, and a cumber- 
some log-on procedure. Also, the Reps report that the time to complete 
the functions and the level of task difficulty were not decreased by the 
introduction of PREMIS. Balancing this, the Reps feel that the system 
has allowed them, as well as the rest of the workers, to serve telephone 
customers more effectively and efficiently with higher confidence and 
less errors. The general positive influence of PREMIS—as seen by the 
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Fig. 4—Service Reps response to “How has PREMIS affected your job?” 


Service Reps—was apparent in the final question of their most recent 
field review questionnaire, “How has PREMIS affected your job?” with 
the average score circled (see Fig. 4). 

In my judgment, this positive reaction is in large part because of a 
systematic influence in the system design decisions of conscious human 
performance engineering principles. In addition, a total system engi- 
neering view must be taken of all design features so that the user 
benefits are not outweighed by hardware/software costs. 

The time is past where large-scale computer systems can be initially 
delivered, found to be unacceptable from a user point of view, and then 
virtually re-done. The applied technology of human performance en- 
gineering has reached the stage in maturity and know-how to influence 
the a priori system design and help achieve a successful initial product. 


Vil. CONCLUSION: A SYSTEM DESIGN PERFORMANCE AID 


The following techniques work when designing computer systems 
that are easy for people to use: 

(1) Have HPE people perform an early task analysis. This can 
substantially avoid unworkable or unreasonable user performance 
requirements. Divide tasks between people and machines, making the 
best use of the strengths of each; for example, allocate simple, repetitive 
jobs to the computer and complex, judgmental jobs to people. 

(zz) Develop work flows early to ensure that the work will get 
done in a logical, workable sequence. 

(zz) Design each system in an incremental fashion—both from a 
software-architecture (internals) and user-transaction (externals) point 
of view. 

(tv) Involve the existing target organization in the system design 
process to help determine appropriate work shifts to achieve optimum 
use of the mechanized assist features. If certain work groups had added 
tasks, make sure they will get credit. 

(v) Understand the impact of the computer system on the exist- 
ing work performance measurements of individuals and groups to 
avoid behavioral resistance. 
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(vi) Design the human/computer interface and transaction com- 
mand structure to be consistent and simple—simple to use and simple 
to learn. . 

(vit) Allow the user to initiate and control all human/computer 
interactions. 

(vit) For a computer system without local intelligence, use a 
batch-prompted (with mask) input mode. 

(tx) Limit inputs and outputs to a single display frame (page). 

(x) Design output screens to allow over-typing of input com- 
mands to request the next transaction. 

(xi) Return meaningful messages in English to the user for all 
possible transaction outcomes. 

(xii) Incorporate computer-assist features (menus, prompts, de- 
faults, minimal input) but only after task analysis is done to ensure 
that these features do not induce unnecessary transactions causing 
computer overload, as well as forcing users to take longer to complete 
their tasks. Perform iterative analysis of alternate designs to establish 
a quantitative computer/human performance balance. 

(xiit) Make system-to-system switching from the same terminal an 
easy operation. 

(xiv) Make terminal function keys and dialogue pnenomics con- 
sistent among systems. 

(xv) Incorporate ease of use principles in information display 
design (data displayed in actual work sequence, data preservation 
across outputs, labeling, etc.). 

(xvi) Make all user messages constructive and supportive, not 
condemning and confrontive. 

(xvii) Establish quantitative system performance objectives and 

computer costs for 

a. Transaction Response Time by transaction class, processing 
type, and outcome 

b. Total System Availability number of occurrences, as well as 
total time 

c. Transaction Failure Rate by transaction class. 

(xviii) Establish quantitative human performance objectives and 
costs for 

a. Human error rate by transaction class 

b. Training time 

c. Work time. 

(xix) SELL THE SYSTEM. Use any means—user group meet- 
ings, presentations, deliverable documents, informal visits—to con- 
vince the target population, top down and bottom up, that the addition 
of the computer to the work environment is positive in terms of Job 
effectiveness and user satisfaction. 
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Graphic Displays of Combined Presentations 
of Acoustic and Articulatory Information 
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Articulatory data have been collected by a computer-controlled 
microbeam system. The locations of lead pellets placed on a speaker’s 
articulators are recorded as x, y coordinates along with the acoustic 
signal and a train of timing pulses which synchronize the frames of 
X-ray data with the acoustic signal. Some 830 utterances have been 
recorded to date, resulting in a database of nearly 120 million bytes. 
We describe the graphic techniques by which those data are exam- 
ined; namely, a simultaneous display of articulatory features such as 
velum lowering, lip and jaw closure, and tongue motion, together 
with the spectral information of the corresponding speech. These 
displays are also annotated with phonetic transcriptions obtained 
automatically by pattern-matching techniques. Acoustic representa- 
tions of speech signals, in a simplified spectrographic format, are 
contrasted with articulatory measures, and implications are dis- 
cussed concerning the difficulty of the automatic recognition of speech 
via conventional input processing. 


l. INTRODUCTION 


Articulatory data specified by x,y coordinate positions of lead pellets 
located on the velum, tongue, lip and jaw, all in the midsagittal plane, 
have been collected by a computer-controlled X-ray microbeam sys- 
tem. The corresponding speech signal and a sequence of timing pulses 
which synchronize the frames of articulation data with the acoustic 
data are also recorded. This system, which operates at the University 
of Tokyo, has been described in detail by Fujimura, Kiritani, and 
Ishida’ and Kiritani, Itoh, and Fujimura.’ In a typical data collection 
session an utterance lasts about six seconds, and approximately 700 
frames of coordinate data on eight pellets are obtained. If fewer pellets 
are used, the frame rate will be higher since pellet positions are 
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determined sequentially for each frame. The waveform of the acoustic 
signal is quantized into 12-bit samples at a 10-kHz sampling rate, and 
each frame of pellet data is marked by an integer pointing to the 
corresponding acoustic waveform sample. All quantities, i.e., pellet 
coordinates, samples of the acoustic wave, and frame pointers, are 
stored as 16-bit quantities, and thus a 6-second utterance requires 
approximately 144,000 bytes. To date there have been some 830 
utterances in two languages—American English and Japanese by a 
total of six different speakers (one female). The resulting database 
totals nearly 120 million bytes. 


ll. ON-LINE CRT DISPLAYS 


It is important in dealing with a large database that it be possible to 
retrieve and examine the data easily. The initial methods employed 
have been of an on-line, interactive nature. Utterances are identified 
by a 5-digit number, which indicates the data-collection session and 
position within the sequence of utterances recorded. Specification of 
this ID number at the computer console results in the retrieval from 
disk storage of all data pertinent to the utterance. Then a time window 
is selected under knob control for a crT display of the time traces of 
the pellets, together with the RMS envelope of the speech wave. A copy 
of such a display is shown in Fig. 1. The ID number and the first and 
last frames of the window are specified on the top line. The text of the 
utterance is printed below, and average values of the pellet traces, 
together with their two-letter designators, such as BY for the y (up- 
down) coordinate of the pellet on the tongue blade, are listed on the 
right for up to six coordinates. The RMS envelope of the acoustic signal 
is Shown at the bottom. Alternatively, an x-y coordinate map showing 
trajectories of movement in the pellet positions is displayed for the 
frames included in the time window. (See Fig. 2.) The window size has 
been cut down to the 45 frames spanning the portion of the text 
indicated by the bracket. The trajectory of each pellet begins at the 
point marked by an arrow and the final position is marked by an X. 
This type of display very clearly reveals the velum lowering for the 
nasalization, as well as the complex activity of the tongue associated 
with this portion of the sentence. The vertical dots represent the 
position of a cursor, which can be moved by a knob through the 
windowed frames. On-line digital-to-analogue facilities permit listening 
to the associated sound for time intervals of varying lengths. Generally, 
these display features have proved very useful in studying the data, 
but there is an inherent inadequacy in the representation of the speech 
signal by waveform envelopes. To combat this shortcoming, we have 
supplemented the output facilities with a spectrographic display of the 
acoustic information. 


800 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1982 


Sects tyres ces aed ori te 
Soe 1 SC 


BEN ANNOUNCED THAT AN INNOCENT-SEEMING INFANT HAD NIMBLY NABBED MOST 


ee 
‘wt - zi, 
“em * 


ese 
“ ce 
a. A \ pe NT PSN oN. Z . Te, im 
ae, \ 7. Lo ey 


. a! 
wa J t 
ON an, prs ‘i > ye 
: * a af Pe “s 
: . : id . = oe 
‘ ‘ : a : ? 
cy fy —™ : ya , Hy 
Mone ~~” bad V Oe is 
Paes a, oer 
‘ PS f oe ee os 
O w if . o 4 oS. 
eal Ss . : . / 4 4 
An aN pent ge 74 ‘. 0 wen! a 
YOON ee \ < Li 
a eS 
* ~ . cae a a4 
ag Oe aoe n — : : 
: te eas ene “~ 
ol, : é : we, Ly IN, v 
Vv + : ‘ a 72 E ‘ 
VW ¢ .* - . 
N oN ~ _ 
ra m 
. * ey . ee, A“ ake 
. : 3 z 
“om epee al Ss oN : _/ = Le ~*~ : 
rT v7 vv . Mn 
tw wv al : i é 
—™ : | 
) 
iN, s 
‘ aw. aie Na a Sc | = = i ; 
’ ‘er on“, 
NAA Nee we” “~L bP " 
« 4 
. oN ‘a * a » Ase 
- os one wae ¥ 2 z 
. . * “oe of N . nan s% 5 P, 
e N. . eo oe fa \y 2 
Set Soon en oe YS oi: | ae 
. ,? 7 oe - . as \ 
. © 7 v he x Pad ss a 
: . 


Fig. 1—On-line display of time traces. 


lll. HARD COPY DIGITAL SPECTROGRAMS 


To obtain the resolution necessary to generate an eight-level gray 
scale, we have employed a Versatec plotter having 200 rasters per inch. 
We define a super pel or picture element as a matrix of 4 by 4 rasters 
and let the number of black rasters out of the sixteen in each super pel 
be a nonlinearly increasing function of the gray level. Specifically, for 
levels 0 through 7 the number of black rasters is 0, 1, 2, 3, 4, 7, 10, 16. 
(See Fig. 3.) The original resolution of 200 rasters per inch is thus 
decreased to 50 pels per inch, but the resulting gray scale proves fairly 
uniform and is quite sufficient to display signal intensity in the fre- 
quency bands of the digital spectrograms. Figure 4 illustrates an 
example that adopts the time-frequency proportions of the traditional 
spectograms.” 

These displays are produced by using a raised cosine on a time 
window of 100 samples, or 10 milliseconds, and computing a spectral 
slice by fast Fourier transform techniques every 40 samples, or 4 
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milliseconds. Each slice is “boosted” by a linear function to raise the 
high frequencies, and all amplitudes above an adjustable threshold are 
linearly mapped onto the eight gray levels. Having a resolution of 50 
pels per inch, a sampling rate of 10 kHz and taking slices every 40 
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samples results in a time scale of 0.2 second per inch. In Fig. 4, one 
inch of height (on the actual computer output) represents 1000 Hz of 
frequency. It has been found, however, that great economies in paper 
space can be achieved by shrinking the frequency (vertical) scale, 
while leaving the time (horizontal) scale unchanged, and this deviation 
from the standard visible speech format does not adversely affect the 
usefulness of the spectrogram. (See Fig. 5 in which there are four 2- 
second lines of spectral data placed on one page as the result of such 
frequency scaling.) We find that if we cut the height down to as little 
as 0.8 inch, we are then able to display a speech spectrogram, together 
with as many as twelve time functions of pellet coordinates on a hard 
copy output. (See Fig. 6.) Such a display is substantially more useful 
than the RMS envelop (as in Fig. 1) for identifying phonetic events in 
the utterance being examined. 

A display of this type can handle at maximum a time window of two 
seconds, which, at typical frame rates of 120 per second, is approxi- 
mately 240 frames. The time scale is marked off both by frames (every 
tenth) and by milliseconds (every 200). Although computed while 
running the display program on-line (at about 100 times real time with 
our present CPU), this output is generated off-line on the Versatec 
plotter. 


IV. OFF-LINE OVERVIEW DISPLAYS 


It is frequently of considerable convenience to both prepare and 
examine the data away from the computer, and in so doing, it is 
advantageous to have a complete representation of as much as an 
entire utterance on the same sheet of output. The example shown in 
Fig. 7 meets these requirements. As an off-line or “background job,” 
we are able to prepare on the Versatec plotter an 8-1/2 by 11 inch 
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Fig. 6—Time traces and digital spectrograms. 


page divided into three horizontal sections, each of which spans two 
seconds, plus 0.1 second of overlap into the next section. The sections 
contain ten stripes representing articulatory time functions coded in 
the eight-level gray scale and plotted in time synchrony with a spec- 
trographic display of the acoustic data. Timing marks at 200-millisec- 
ond intervals are also indicated. | 

The ten gray-level stripes which describe the articulatory events are 
arranged as follows: There is a top set of four stripes indicating front- 
back or horizontal motion for, from the top, the lower lip and three 
tongue pellets (blade, mid and posterior position on the tongue sur- 
face), and a lower set of six which portray vertical motion for lower lip, 
mandible, three tongue pellets, and the velum. The correspondence 
between stripes and articulators may be determined by associating the 
stripes ordered from top to bottom on the page with the articulators 
ordered from front to back in the speaker’s head. The range of 
coordinate values for each pellet is linearly mapped into eight levels of » 
gray such that the minimum is the first level or white, the average 
value is level five or medium gray, and large values are level eight or 
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Fig. 7—Off-line display of articulatory and acoustic data. 


black. That is, in Fig. 7 black corresponds to up for vertical motion 
and front for horizontal motion. 


V. FEATURE-ENHANCED DISPLAYS 


As an extension of the display of the type shown in Fig. 7, which 
serves as a record and check on the database, we produce others of the 
type shown in Fig. 8, designed to emphasize articulatory features and 
contrast articulatory and acoustic events. In these, we reverse the 
mapping onto the gray scale for vertical motion of both the velum and 
jaw pellet and horizontal motion of the lip. As a result of these 
reversals, for example, the nasalization produced by a lowered velum, 
stress correlated with the minima in the jaw position, and the conso- 
nantal gesture indicated by lip constriction and a raised or advanced 
position of the tongue are easily spotted as black regions in the stripes 
(see below). It is an option of the program to produce the output with 
the gray scale for the stripes used in reverse so that when the displays 
are reproduced as transparencies the stripes can be individually col- 
ored. Thus, the features can be readily identified by a color coding. 

It will be immediately noticed that the spectrographic representation 
of the acoustic information has been substantially modified. It has a 
checker-board appearance due to the frequency range having been 
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Fig. 8—Feature-enhanced display of articulatory and acoustic data. 


grouped into eight bands. This output was produced by calculating a 
Fourier transform every two milliseconds and taking the maximum 
amplitude values obtained from three adjacent spectral slices. The 
maxima are used as an estimate of the spectrum at time points spaced 
every four milliseconds. The time window for the Fourier transform is 
five milliseconds; hence, the frequency resolution is 200 Hz, and the 
first twenty frequency components span a usable frequency range of 
4000 Hz. These components are grouped into eight bands centered 
around 200, 400, 600, 900, 1300, 1800, 2500, and 3500 Hz, respectively. 
The average output magnitude in each of these bands is then mapped 
onto the gray scale. It will be noted that with such a grouping, the first 
formant and the nasal formant will be found in bands 1 to 4, the second 
in 4 to 6, and the third in 6 and 7. Voiceless fricatives and stops show 
up often in bands 7 and 8. 

The black, gray, and white stripe plotted below the spectral display 
indicates the voiced, unvoiced, and silence (VUS) coding of the acoustic 
signal as determined by the Atal-Rabiner algorithm.* 
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Finally, a processing due to Nelson?” utilizes vus coding and infor- 
mation on jaw minima to locate phrase boundaries, and then identifies 
occurrences of specific articulatory events that are predicted from a 
feature interpretation of a sequence of phonetic symbols obtained from 
a manual transcription of the utterance material. Each symbol is 
positioned automatically, according to these pattern identifications, 
and the resulting symbol alignment is shown in Fig. 8. A vus stripe 
and phonetic annotations will also appear on displays of the type 
shown in Fig. 6 and 7, provided the utterance has been processed with 
the coding algorithm and the Nelson pattern-matching scheme. 

The display of the acoustic data in Fig. 8 represents a reduction by 
a factor of five over the traditional spectrogram on information to be 
stored; that is, from 40 harmonics to eight bands per spectral slice 
(albeit at the expense of more computation for the particular methods 
we use). But more important is its very “block-like” appearance, which 
in contrast to the smoothly fluctuating “grayness” of the articulation 
stripes, reveals apparent “segmentation” of the time events. These 
blocks make it easier to evaluate the success of the phonetic alignment. 
Also, since the frequency bands are selected considering the human 
auditory characteristics, the spectral representation may be similar to 
what the human perceives. 

The marginal notations in Fig. 8 point out examples of the feature 
enhancement mentioned above. The utterance in this case is “Ben 
announced that an innocent seeming infant had nimbly nabbed most 
of the bananas.” The nasalization occuring in “Ben announced” is 
indicated by the black on the bottom of the pellet stripe set. There are 
numerous other occurrences of nasalization in the sentence also 
marked by black in the bottom stripe, except where pellet tracking 
failed during the recording session. The missing pellet data are indi- 
cated by the thin line in an interval of approximately 600 milliseconds 
beginning about two seconds into the utterance (end of first section— 
beginning of second). Stressed words are revealed by black in the jaw 
stripe—second from top in a group of six. The feature of stress is also 
accompanied by a lowering of the lip which is indicated by white in 
the top stripe of the six, unless the vowel shows lip constriction. Hence, 
in most cases the white-black combination of these two stripes serves 
as a very clear cue for the stressed sounds. 

Since the horizontal motion of the lip 1s plotted in the top stripe of 
the set of four, with black indicating retraction of the lip in this type 
of display, and since vertical motion is plotted in the top stripe of the 
set of six, labials such as /b/ and /m/ have black only in the set of six. 
However, the fricative /v/ in which the lower lip is retracted would be 
identified by black in both stripes. Color coding in which the same 
color is used for each articulator in both horizontal and vertical 
dimensions is helpful in directly identifying such articulatory features. 
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Apical consonants, namely, /t/, /d/, and /n/, are produced with the 
tongue tip and blade high behind the teeth and, hence, are revealed by 
black in the stripe third from top in the set of six. 

The palatal sounds produced with a high mid-position of the tongue 
are similarly marked by black in the fourth stripe. All these articulatory 
characteristics are utilized in Nelson’s automatic annotation. 

We can with these displays go beyond the mere identification and 
location of these articulatory events and observe interesting time 
relations between the movements of the articulators and the resulting 
temporal patterns of the acoustic output. For example, we observe the 
very early onset of the lip closure for /b/ at the beginning of the 
sentence. Nasalization for the nasal consonants in the beginning phrase 
extends continuously over several syllables. We can also see how the 
labialization trails the vowel glides that cause the gesture as in “an- 
nounced” and “most,” where the lip constriction extends well beyond 
the bounds of the vocalic segment. Palatalization can be seen to extend 
through all the front vowels in the words “seeming infant,” unaffected 
by the intervening consonants. We note finally the state of the artic- 
ulators at the conclusion of the sentence and contrast the slow but still 
changing activity during this silence with the fast, yet smooth, move- 
ment during the sound production. 


VI. CONCLUDING REMARKS 


In conclusion, we point out that the gray scale facilities have made 
it possible to reveal a great deal of information on a single page of 
output and to be freed in many cases from the necessity of on-line use 
of the computer as a means of data inspection. Furthermore, adequate 
use of the gray scale has made it possible to detect important features 
in the events and interesting relationships between the acoustic and 
articulatory events. Such spectrographic information as we observe in 
Fig. 8 is typical of the type of input used by an automatic speech 
recognition system; although, the coarse sampling used by most sys- 
tems often eliminates the sharp discontinuities we see here. In this 
connection, it is interesting to note that the acoustic information as 
shown here is in many ways complementary to the articulatory infor- 
mation shown in comparison. Individual consonantal gestures, partic- 
ularly in terms of place distinction (e.g., /p/ as opposed to /t/ or /k/) 
are obvious by articulatory measures, but often very difficult to identify 
via acoustic signals. We need more complex details, such as continuous 
formant transition patterns seen in Fig. 7, or some ad hoc and context- 
sensitive feature detection strategies based on such details, for acoustic 
identification of phonemic characteristics. While the articulatory ges- 
tures are sluggish and asynchronous, their phonemic correlates are, for 
the pertinent dimension, reproducible and often nearly invariant.°® 
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After having identified the real invariant characteristics in the articu- 
latory aspects that are only indirectly observable in the acoustic 
signals, we can ask proper questions as to how we might process such 
acoustic signals most effectively to derive the necessary phonetic 
information. To the extent we are impressed by an acoustic represen- 
tation as in Fig. 8, we can identify the phonetic message by the present 
machines. To the extent we have to supplement with the other 
representations, such as direct articulatory or abstract pattern match- 
ing of more detailed acoustic information, effective speech recognition 
schemes remain to be devised. 
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1A Voice Storage System: 


Voice Storage in the Network—Perspective 
and History 


By E. NUSSBAUM 


(Manuscript received November 17, 1981) 


In mid-1976, Bell Laboratories undertook development of the then 
radical concept of introducing new customer services via a voice- 
storage capability in the network. With four systems installed and 
ready for service, the project was terminated in October, 1981, as a 
result of Federal Communications Commission (FcC) actions stemming 
from the Computer Inquiry II decisions declaring voice storage to be 
an enhanced service that could not be offered by the regulated network. 

The following five papers, describing the services, architecture, and 
technology of the 1A Voice Storage System (vss) implementation of 
this Custom Calling Services (ccs) II offering were written two years 
ago and held until the regulatory outcome was settled. This brief 
introduction is intended to provide some background and perspective 
on the intervening time period. 

The basic design concepts behind the ccs II offering and its associ- 
ated 1A VSS serving vehicle can be categorized into four major ele- 
ments: 

(t) Provision of a new class of flexible stored program controlled 
customer services involving storage, for later delivery, of the customer’s 
voice messages under either called party control (Call Answering 
services) or calling party control (Advance Calling services)—to be 
known collectively as Custom Calling Services IT. The design of ccs II 
included careful attention to interaction with previously existing ESSs- 
based services (Custom Calling Services I) and to software flexibility 
of feature definitions, as has historically been the case in other stored 
program controlled systems. 
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(tt) For widespread availability and easy deployment, and for min- 
imum costs, a system configuration consisting of shared voice storage 
nodes, each subtending multiple local Ess offices via dedicated trunk 
groups to provide access for customers. Appropriate traffic engineering 
allows for rapid changes to meet changing demand needs and forecasts. 

(wit) Extensive use of digital technology to assure optimum oppor- 
tunity to ride the silicon learning curve, including disk storage for 
digitally encoded speech, stored program control with distributed 
peripheral microprocessors, and the use of coder/decoders (CODECs), 
digital buffering, and custom large-scale integration (LSI) at the inter- 
faces. 

(tv) Emphasis on overall integrated network operation between the 
Ess and 1A vss entities, including a formalized signaling plan, joint 
operation on billing functions and service orders, and the use of a 
common operations support system network. 

Based on these concepts, the 1A vss was designed and programmed 
for providing ccs II as described in the five papers that follow. The 
first system was installed in Philadelphia, Pennsylvania, and the first 
calls were processed there in early 1979. Initial experiments indicated 
that the human factors aspects of the design (customer interaction and 
perception of the services) were generally good and required only very 
minor changes. However, the reliability of the service, in terms of lost 
calls and integrity of the long-term voice message database, was below 
expectations, as was the total throughput capability of the centralized 
vss. During 1979, required software changes were undertaken and 
retested and three additional systems were installed in New York, 
Dallas, and Chicago. In March, 1980, full-time (24 hours a day) 
“friendly user” service was established with the Philadelphia 1A vss, 
starting with 35 Bell of Pennsylvania employees using the service from 
their residence phones, and subsequently progressing to 150 employees. 
Similar friendly user service was also initiated at other sites with the 
largest activity taking place at the Dallas site where friendly users 
numbered 450 people by late 1980. 

As a result of these successful tests, a tariff was filed for the 
Philadelphia offering with the Pennsylvania Public Utility Commission 
(puc) in May, 1980, for planned July, 1980, service. Plans were made 
for filing tariffs for the other sites shortly thereafter. The tariff was 
not approved by the puc because of a pending antitrust suit filed by 
the Associated Telephone Answering Exchanges, Inc. (ATAE) and 
related PUC proceedings. Viability of the service offering was further 
complicated by the uncertainty created by the May, 1980, Fcc order 
under its Second Computer Inquiry findings that enhanced services 
encompassed the area of voice storage, which could therefore not be 
offered as part of the regulated telephone network. 
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By early December, 1980, the Pennsylvania Puc’s Administrative 
Law Judge recommended rejection of the ATAE position, but service 
continued to be deferred because of Fcc rulings. A late December, 
1980, Fcc order specifically reaffirmed the earlier definition of voice 
storage as an enhanced service, but provided AT&T the option of 
requesting waivers under certain circumstances. The Bell System filed 
such a petition for waiver for ccs II in March, 1981. During the 
intervening period, all four systems continued to provide friendly user 
services to Bell System employees, and considerable data on customer 
reaction and system performance continued to be recorded. 

In October, 1981, the Fcc rejected AT&T’s petition for waiver to 
allow the offering of ccs II as part of a regulated network service. At 
that point, the offering was withdrawn and the project terminated. At 
the time of the withdrawal of the service, measured data at the four 
sites and on laboratory support monitoring equipment indicated that 
the system met or exceeded all of its original design requirements on 
customer machine interactions, maintainability, reliability, and 
throughput. 

The following five papers document the architecture of the service 
and of the serving vehicle. The confluence of these customer needs 
with the rapid progress in voice-storage technology will undoubtedly 
result in numerous future applications. 
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I. INTRODUCTION 
1.1 Motivation 

In modern times, distant voice communication in the United States 
has been predominantly by telephone. There are some applications for 
which voice is recorded, transmitted (perhaps by mail), and replayed, 
but this form of “taped letter” is the exception rather than the rule. 

A constraining factor when communicating by telephone is that 
both the calling and called parties must simultaneously be available. 
The probability of a successful call completion is approximately 0.75. 
Unfortunately, the successful completion is often to the desired line 
but not to the desired terminating party and, thus, additional calls are 
required to complete the communication. 

In many cases, the required communication need be only one way. 
When this is true, it would be convenient to leave a voice message on 
the first calling attempt. Such a capability is being introduced with 
the 1A Voice Storage System (vss) and Custom Calling Services (ccs) 
II. 


1.2 Elements of voice communication 


Clearly, when people are together in the same place at the same 
time, they can simply talk. If they are available at the same time, but 
are in different places, they need some sort of transmission path 
between them. This path can be as exotic as a fiber optic cable carrying 
millions of conversations simultaneously in a digital bit stream or it 
can be as simple as a string between two tin cans. 

If people are in the same place at different times, some storage 
media is required. For text, the medium could be paper on a bulletin 
board, a paper note, or a message scrawled on the washroom wall. For 
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storing voice communication, some sort of a sound recording mecha- 
nism is required. 

The 1A vss brings together a unified communication system incor- 
porating the required transmission paths and storage media to permit 
voice communication between individuals who do not coexist in either 
space or time. 


1.3 History 


Early in the history of the telephone, telephone operators served as 
receivers and transmitters of messages for their customers. In many 
small towns, they simply jotted down notes, and periodically tried to 
deliver them whenever they had time available. 

In all but the smallest exchanges, this quickly became an untenable 
situation. In self-defense, the operators quickly reverted back to simply 
making connections. The message storage function had to be provided 
in some other way. 

Eventually, telephone answering bureaus sprang up, and these have 
been available for more than 50 years. Later, an attempt was made to 
mechanize the storage and playback of one-way voice messages 
through customer premises answer-and-record devices. These have 
been provided by the Bell System and several other companies since 
the early 1950s. 

Initially, answer-and-record devices were quite expensive and their 
major use was limited to providing universal announcements which 
could be accessed by many people. They proved to be more economical 
when a single device was used to store the messages of many people 
than when they were used for individual customers. 

The concept of a centralized vehicle to implement voice storage 
features had its origin as a Bell Laboratories research concept during 
the 1940s. On-going research into specific component technologies, 
control architectures, service definitions, and system implementations 
continued until the early 1970s when the required technologies had 
matured sufficiently to enable a cost-effective realization of a vss. 
During 1975, the development of a vss began, with the specific design 
for the 1A vss reaching completion during 1976. The first 1A vss was 
shipped to The Bell Telephone Company of Pennsylvania in August, 
1978, with the expectation that it would be placed in service with ccs 
II features in 1980. 

Many people have viewed the concept of storing and forwarding 
voice messages with almost as much enthusiasm as the concept of 
storing and forwarding text messages via so-called “electronic mail.” 
Just as predictions were made in the 1930s that newspapers would 
soon be delivered by wire, many predictions have been made about 
the utility of delivering stored voice messages by wire. While the 
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technology for providing both of these services has existed for some 
time, the projected costs have always been prohibitive. With this 
thought in mind, the next section will describe some of the components 
which were required to build the 1A vss with the attributes discussed 
above. 


ll. THE 1A VSS IMPLEMENTATION 
2.1 Requirements 


The 1A vss must meet a host of access, cost, maintenance, reliability, 
and capacity requirements. Access to the beginning of any message 
must be provided within seconds. The costs must compare favorably 
with those of customer premises equipment. 

The storage system should be as reliable as the transmission path 
connecting to it. It must provide sufficient capacity to ensure service 
when needed. Typically, the total storage capacity requirements for a 
shared facility are substantially less than for per-customer facilities. 

The system must be engineered, taking into account estimates of 
expected holding time, average storage time, probability of message 
left, and the fact that 25 percent of normal “plain old telephone 
service” (POTS) calls terminate in “busy” or “do not answer.” 

The cost of storage is a function of both message length and storage 
time. While acceptable costs may be achievable through the use of 
analog recording equipment, we have found that access, maintenance, 
reliability, and versatility considerations all tend to favor a digital 
approach. 


2.2 A new node in the stored program control network 


As with many new telecommunication services, the cost of a 1A vss 
must initially be spread over as wide a customer base as possible. This 
implies that a single 1A vss system should provide storage capabilities 
for several switching systems rather than providing the required logic 
and storage in each electronic switching system (Ess) office. 

The connection between stored program control (spc) switching 
offices and the 1A vss is represented schematically in Fig. 1. Note that 
the 1A vss system looks much like an independent office. It receives 
inputs over special voice access trunks, it has its own control processor, 
and it directs traffic in and out of a storage subsystem directly instead 
of through the processor to the storage subsystem. From a control 
point of view, the ESS systems have ready access to the line state of 
customers, whereas the 1A vss knows about the stored messages; 
therefore, control of the services requires close cooperation between 
the software in Ess and in 1A VSss. 

To make css II features simultaneously available to hundreds of 
thousands of Ess customers on the basis of casual (i.e., daily) activation 
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and deactivation of service, the use of dedicated, per-line equipment 
was quickly ruled out in favor of trunk-interfaced equipment. The 
requirement to enable any customer’s rapid access to messages stored 
on his behalf similarly ruled out the use of sequential storage media, 
such as analog or digital tape. Therefore, it is apparent that the storage 
subsystem is a major cost item and as such deserves considerable 
attention. Even though memory costs continue to go down between 20 
and 30 percent per year, the amount of storage required is still large 
enough to keep the memory subsystem costs significant. This problem 
is made worse when reliability requirements force the duplicate storage 
of some messages. 

If an encoding rate between 32 and 64 kb is required, bubble and 
charge-coupled devices (ccDs) become prohibitively expensive. Mov- 
ing-head disks currently appear to provide the most viable solution 
from both economical and performance standpoints. 

The advent of economical high-capacity disk storage has made the 
1A vss economically feasible. The description of the implementation 
of the 1A vss is covered in the companion article “Architecture and 
Physical Design,” by R. G. Cornell and J. V. Smith, in this issue of 
The Bell System Technical Journal. 


2.3 Custom calling services Il 
With the availability of the 1A vss as a flexible node in the spc 
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network, numerous Imaginative and useful services can be provided. 
The initial ccs II services were chosen after extensive study of cus- 
tomer requests and market surveys to determine utility and conveni- 
ance. The ccs II services are described in detail in the companion 
article “New Custom Calling Services,” by D. P. Worrall, also appear- 
ing in this issue. 


2.4 Call answering service 


Answer-and-record services provided centrally can be considerably 
enhanced over those provided solely via customer premises equipment. 
For example, the customer’s phone can be answered on “busy,” as well 
as on “no answer.” During an office busy hour, the number of busys 
and no answers are approximately the same. 

With the use of the remote access options, the 1A vss provides the 
answer-and-record service to customers who are away from their home 
telephone. | 


2.5 Advance calling services 


The ccs II entry into the voice store and forward market is called 
Advance Calling. Advance Calling allows the calling party to record a 
message to be delivered at a future time. The customer may specify a 
future time or have the 1A vss deliver the message immediately. 
Messages may be sent to most telephones which can be dialed directly, 
even to an individual’s phone as a reminder or a wake-up call. 


ill. SUMMARY 


Installation of the 1A vss, as an extension of the transmission and 
switching facilities already provided by the telephone network, leads 
to a new dimension of utility and convenience for the telephone 
customer. For many people it will remove the frustration of trying to 
get through. For others, it will provide convenient access to information 
of special interest. 

The technology is here today. The companion papers in this issue 
define the ccs II features, the implementation of the 1A vss hardware, 
operational software, maintenance software, office engineering, and 
testing considerations. 

The 1A vss will provide a new dimension in voice communications 
which we have just begun to exploit. 
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In creating new customer services, systems engineers traditionally 
conduct market needs analysis, engineering feasibility studies, and 
project economic analysis. Finally, they prepare development require- 
ments. In addition, systems engineering designs the service from the 
customer’s point of view. This article describes new custom calling 
services as the customer sees them and highlights some interesting 
features of the services. 


l. INTRODUCTION 


The introduction of No. 1 Ess in 1965 signaled the beginning of new 
telephone services to the residence and business customer. Some of 
these services, called Custom Calling Services (ccs), were made avail- 
able to customers served by ESs central offices and allowed customers 
to make more efficient use of their telephone service. Custom Calling 
Services include Call Waiting, Call Forwarding, Three-way Calling, 
and Speed Calling. 

Call Waiting, the most popular service, notifies customers of a 
second call when they are already on a call. The customer has the 
option of answering the second call and placing the first call on hold. 
The service has been very popular with customers who use the tele- 
phone frequently. 

Another popular service is Call Forwarding, which allows customers 
to “transfer” all of their incoming telephone calls to another telephone 
number. When the service 1s activated, calls to the customer’s number 
will automatically be forwarded to another number which the customer 
specified when the service was activated. Originally, calls could only 
be forwarded within the local dialing area of the customer, but cus- 
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tomer reaction to that limitation soon caused a change so that calls 
can now be forwarded anywhere. The service is popular with small 
business customers who forward their calls between their business 
locations and their homes. 

Three-way Calling, previously called Conference Calling, allows 
customers to add on a third party to an established telephone call and 
then talk with both parties at the same time in a full three-way 
conversation. Customers involved in community affairs use Three-way 
Calling to help arrange group activities. 

Speed Calling, previously called Abbreviated Dialing, offers cus- 
tomers the capability to dial frequently called numbers with only one 
or two digits. Customers may choose an eight-number list and/or a 30- 
number list. Initially, customers could only change their list of numbers 
via a telephone company service order. This process was time consum- 
ing and expensive and detracted from the value of the service. The 
service has been upgraded to allow customers to change the contents 
of their Speed Calling lists directly from their own telephone. This 
service improvement has increased the value of Speed Calling to 
customers. Customers not only enter frequently called numbers into 
the list but also special numbers, such as fire, police, first aid, and 
poison control, for easier and faster access to emergency services. 

At the end of 1978, ccs services were available to 15 million cus- 
tomers served by Ess central offices. Customers currently subscribe to 
over 4 million ccs services. Since 1965, there has been a rapid growth 
and wide distribution of Ess central offices, allowing marketing orga- 
nizations to more effectively advertise and sell these services to our 
customers. The results of these efforts can easily be seen in Fig. 1. 

But today’s communications-oriented society requires still more 
advanced telecommunications services. Market studies have indicated 
customer interest In a wide variety of new and modern telephone 
related services. Some of this interest stems from the limitations of 
today’s ccs. These services require the physical presence of the cus- 
tomer at a telephone to actually use or obtain value from the services. 
But in today’s very active and mobile society, customers desire control 
over their communications services even when they are not at their 
telephones. 

Several of these service needs, which are the subject of this paper, 
concern the ability of a customer: 

(z) to receive a message from someone when the customer is not 
at home or is too busy to answer the telephone, 

(tz) to send a message to someone when the customer cannot, or 
chooses not to, reach them directly, 

(zzz) to give information to callers, and 

(tv) to control services when the customer is not at home. The 
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Fig. 1—Bell System sales. 


evolution of these “customer needs” into actual telephone services has 
resulted in the addition of Call Answering, Advance Calling, Custom 
Announcement, and Remote Access services to the existing Custom 
Calling Services. 

Call Answering provides telephone answering and message recording 
service for both residence and small business customers. The service 
offers the general capability to answer a call, deliver to the caller a 
customer-recorded greeting, and then record a message from the caller. 
Messages are accumulated and delivered to the customer upon request. 
The service is under control of the customer, may be used from rotary 
dial or dual-tone multifrequency (DTMF) signaling telephones, and 
requires no special equipment on the customer’s premises. 

Advance Calling service adds a new dimension to telephone service. 
Advance Calling offers customers the capability to record a message 
and have it sent to a designated telephone number at a future time. It 
is for use by customers when direct communications is not desirable or 
not possible because lines are busy or the call spans large time-zone 
differences. 

Custom Announcement Service allows customers to record an infor- 
mational announcement, which 1s delivered to anyone who calls the 
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customer’s telephone number. It is for use by small businesses, schools, 
social groups, and churches as an information distribution service. 

Remote Access will allow customers to access and control these new 
services from another telephone when they are not at the telephone 
with the service. 

The chart below summarizes the original and new ccs and divides 
them into terminating and originating categories. 

Each of the new CCs services will be described in more detail in the 
following sections. 


Terminating Services Originating Services 
Call Waiting” | Three-way Calling* 
Call Forwarding* Speed Calling* 

Call Answering’ Advance Calling* 


Custom Announcement! 


ll. SERVICE DEVELOPMENT 


Detailed service specification work began with an in-depth analysis 
of various market studies, which were done by AT&T to identify the 
demand for new services. Combining these market data with an 
understanding of the technology to provide these new services resulted 
in a general outline of a service, with many possible service options. 
Potential customers were interviewed individually and in groups to 
further understand customer needs and interests in the services and 
options. 

When a form of a service already existed, studies were conducted to 
determine their basic capabilities, their strengths, and their weak- 
nesses. Users of these services were questioned to determine their likes 
and dislikes about the services. Human factors studies were done to 
optimize the human interface to the proposed new services. 

All of this information was used to generate service descriptions and 
detailed specifications for service operations. The services were devel- 
oped to offer infrequent users the basic capabilities. Also the services 
contained enough flexibility and options to satisfy a wide variety of 
needs for frequent users. Many unique service capabilities are incor- 
porated which are not available from existing alternatives. 

The following sections describe in detail the new service offerings. 
Since the development of new services is an ever-evolving process, the 
actual services designed may differ slightly from the following descrip- 
tions. Customer reactions to the services will be evaluated and changes 
and enhancements will be designed to meet customer needs. 


* Original Custom Calling Services, now called ccs I. 
* New Custom Calling Services, now called ccs II. 
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Ii, CALL ANSWERING SERVICE 


Call Answering (CA) service is a sophisticated telephone answering 
and message storage service which provides a variety of service capa- 
bilities and options to meet the needs of different users. The service 
answers calls, delivers a customer-recorded greeting to the caller, 
records a message from the caller, and then stores the message for 
later retrieval by the customer. Initially, two forms of CA service will 
be offered. First, a Daily Call Answering (DCA) service is designed for 
customers with an infrequent or occasional need for an answering 
service (e.g., night out, weekends, vacations, etc.). This service is 
simple to access and control and may be activated at any time from 
the customer’s telephone without prior arrangement with the Bell 
operating company (Boc). A maximum number of messages stored is 
established for DCA customers. The service is available to customers 
with rotary dial or DTMF signaling. The customer will be charged for 
the duration of service activation, plus a charge for each call received. 

The second ca service, Monthly Call Answering (MCA) has been 
designed for the user who has a regular need for CA service. It provides 
greater message storage capacity and message handling capability than 
the DCA service. A customer desiring MCA service must subscribe to 
the service by contacting the Boc Business Office and will be charged 
a monthly fee, plus a usage charge based on the number of calls 
received. 

The CA service can be regarded as having four phases: service 
activation; call answering and message recording; message retrieval; 
and service deactivation. Before describing each phase, several terms 
needed to be defined. A greeting is the recording which is delivered to 
the calling party by the CA service. The greeting may be either a 
standard Boc-provided message or one which has been personally 
recorded by the customer. A message is a recording made by a caller 
(in response to a customer greeting), which is stored for later retrieval 
by the customer. Prompting announcements or prompts are a special 
set of CA service announcements that will assist a customer in using 
the service. 


iV. CALL ANSWERING—SERVICE ACTIVATION 


To activate DCA service, a customer dials the service activation code 
(Table I) and a prompt asks the customer to select either the standard 
or personal greeting by dialing either 2 or 7. A customer who chooses 
the personal greeting is instructed to start recording after the record 
tone. If the customer stays on the line after finishing the recording, 
the personal greeting is replayed for the customer’s approval. 

To activate MCA service, a customer dials the service activation code. 
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Table 1—Custom Calling Services Il Access 


Codes 
Codet 

Call Answering 

Activation 3 1151 

Message retrieval 1152 

Deactivation 1153 

Monitor 1154 
Advance Calling 

Message recording 1141 

Status check 1145 
Custom Announcement 

Activation 1158 

Deactivation 1159 
Remote Access Seven-digit telephone number 
Privacy Code Change 1161 


tCustomers with DTMF signalling telephones may use * in- 
stead of the digits 11 (i.e., *51 instead of 1151). 


(Note, the service access codes are the same for DCA and MCA Service.) 
The service identifies the customer as a monthly subscriber. The 
customer then receives prompting instructions to enter a service 
control code. An MCA customer may record a new greeting, replay an 
existing greeting, choose the standard greeting, or choose the number 
of rings before CA answers. An MCA customer may also hang up during 
this prompting announcement, in which case the CA service will be 
activated with the existing greeting that is on file for that customer. 
For both DCA and MCA customers, CA service is activated when the 
customer hangs up. Customers may verify activation of the service by 
dialing their own telephone numbers and having the call answered by 
CA service. A customer wishing to change or modify a greeting may do 
so at any time by simply performing the activation procedure again. 


V. CALL ANSWERING—CALL ANSWER AND MESSAGE RECORDING 


Once CA service has been activated, the service begins to answer 
calls to the customer’s telephone. The most typical call type to be 
answered by CA service is the “no-answer’”’ call. If a DCA customer does 
not answer the call within three ringing cycles (approximately 18 
seconds), the call will be switched from the customer’s line and 
connected to CA service. For MCA, the customer selects the number of 
rings (from 0 to 7) during activation. If 0 rings is selected, the call is 
immediately answered by CA service, regardless of the busy/idle status 
of the customer’s line. 

When a call is answered by CA, a CA service Logo (a short series of 
tone signals uniquely identifying CA service) and the customer’s greet- 
ing are delivered to the caller. The cA service automatically provides 
a record tone at the end of the greeting and switches to a recording 
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mode to record a message from the caller. A message is considered 
received if the service detects the presence of voice signals for a 
duration of at least one-half second. After the message is recorded, it 
is duplicated for reliability, and stored in that customer’s message file 
for later retrieval. Voice signals shorter than one-half second are erased 
and not stored for the customer. This prevents customers from hearing 
disconnect tones and noises from callers who did not wish to leave a 
message. 

A caller has ten seconds from the record tone to begin a message. 
Recording does not start until the service detects the presence of audio 
signals after the record tone. Thus, messages stored for a customer do 
not include the silent interval prior to the first voice signal. Also, the 
trailing silence at the end of a message is erased, so customers need 
not listen to silence during message retrieval. The service stops record- 
ing if 3 seconds of continuous silence is detected or if the maximum 
message length is reached. The service automatically disconnects the 
call at the end of message recording. 

In addition to answering a call when the customer does not, CA 
service provides the unique capability to answer calls when the cus- 
tomer’s line is busy. Furthermore, CA service has the capability to 
handle several calls at the same time. Typically, the basic service will 
answer only two simultaneous calls, but monthly customers have the 
option (at an additional charge) to specify up to 15 simultaneous calls 
to be answered by CA service. Additional calls, exceeding a customer’s 
simultaneous calls limit, would receive normal busy tone. 

Another unique aspect of CA service is that whenever CA service is 
answering a call and/or taking a message, the customer’s line is free to 
handle other calls. The customer may answer other calls or place an 
outgoing call. The line is not tied up while a message is being recorded. 

Because CA has been designed to supplement the existing Custom 
Calling Services, its interactions with Call Waiting (cw) and Call 
Forwarding (cF) have been carefully designed. If a customer has both 
cw and ca, during the activation phase of ca, the Cw service is 
momentarily inhibited so that the cw tone sequence will not be 
recorded into the customer’s greeting. During a period when CA is 
active, Cw and CA interact to maximize the opportunity for the cus- 
tomer to receive a call. When the customer is dialing or the customer’s 
phone is ringing, cw tone cannot be given so the call is answered by CA 
service. When the customer is busy on an established call and another 
call arrives, the normal cw tone is provided. The customer may flash 
the switchhook and answer the second call. If the customer does not 
answer the cw call, normal procedure for CW service is to provide a 
second cw tone 10 seconds after the initial cw tone. If CA service is 
active, the second cw tone is not provided, but at the time the tone 
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would have occurred, the call is answered by ca. This procedure gives 
the customer with Cw and CA service a great deal of flexibility in 
handling incoming calls. 

With the above arrangement for cw and CA service, a customer can 
“answer” at least four calls at the same time. The first call is answered 
directly, the second could be answered by cw, and the third and fourth 
calls could be answered by CA service. 

A customer may have CF service, as well as CA service. In this case, 
both services cannot be active at the same time. One service must be 
deactivated before the other service can be activated. A call can be 
either forwarded to another telephone number or answered by CA 
service but not both. 

After CA service answers a call, MCA customers may screen incoming 
calls by listening to the message which is being left. Customers dial 
the monitor access code (Table I), and CA service bridges the customer 
onto the call which has been answered. The bridge is a one-way bridge, 
allowing the customer to only listen to the message being left. If the 
customer desires, he may be connected directly to the caller by flashing 
the switchhook. The caller is switched off of CA service and is connected 
to the customer. Any message which may have been recorded at that 
time is saved and stored away with all other messages for that 
customer. 

To protect customers and the Boc from abusive use of CA service, 
several constraints are placed on message recording capabilities. These 
constraints include limits on (z) the length of an individual message— 
the service stops recording when the maximum message length for 
that customer has been reached; (zz) the number of messages stored 
for a customer at any one time—when the limit is reached, the service 
automatically turns itself off until some of the messages are retrieved; 
and (iit) the amount of time a message may remain in storage—the 
service will erase any message exceeding this storage limit. Typically, 
the limits are 30-second messages, 30 messages in storage at one time, 
and seven days storage. Each of these limits may be specified for 
individual MCA customers. 

Another way has been provided to establish the limits. The system 
has a capability which will provide customers with a fixed amount of 
storage. The service will answer calls and take messages of any length 
until the storage for that customer is full. The limit on message length 
or number of messages, or both, is thereby removed with this option. 


VI. CALL ANSWERING—MESSAGE RETRIEVAL 


Whenever messages reside in a CA customer’s message file, two 
message waiting signals are given to the customer to notify him of the 
existence of these messages. Whenever a customer goes off-hook to 
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place a call, the initial 2 seconds of dial tone is interrupted (i.e., turned 
on and off rapidly). A customer may dial during the interrupted dial 
tone. The second signal, a short burst (500 ms) of ringing is applied to 
the line whenever a customer disconnects from a completed call. The 
customer may, at his discretion, retrieve messages or ignore either 
signal. 

The message retrieval procedure is basically the same for both DCA 
and MCA customers. The customer dials a retrieval code (Table I) and 
is connected to CA service. The CA service responds to the customer 
with a prompt stating “You have had M calls since you last played 
back your messages and you have N messages waiting,” where M 
equals the number of calls answered by CA and N equals the number 
of messages for the customer. Each message recorded for that customer 
is now delivered in the same order in which the messages were received. 
A unique feature of CA service is that just prior to returning each 
message, CA service automatically tells the customer both the day of 
the week and the time of day when that message was recorded—for 
example, “Thursday—9:15 a.m.” The message 1s then delivered to the 
customer. Messages are separated by a short tone and approximately 
3 seconds of silence. 

Daily CA customers have very basic message handling capabilities 
during retrieval. Messages for a DCA customer are delivered once in 
chronological order. After complete delivery of all messages, a service 
prompt indicates that if the customer continues to listen, the messages 
will be repeated two more times. During any of the message deliveries, 
the DCA customer may “skip” the delivery of a message by dialing the 
digit 4. The message is skipped and the next message 1n the sequence 
is started. Whenever an DCA customer hangs up during retrieval, all 
messages which were delivered completely at least once (or skipped) 
are automatically erased from the message file. All other messages are 
retained. 

Monthly cA customers have complete message handling capabilities 
including save, which saves a message until the next message retrieval 
request; repeat, which immediately begins delivery of the current 
message again; pause, which causes a 5-second hold on message 
delivered; and skip, which causes the current message to be stopped 
and the next message in the sequence to begin. Successive dialing of 
skip or repeat causes MCA Service to sequence ahead one message at a 
time or to sequence backward one message at a time. 

Whenever an MCA customer hangs up from message retrieval, all 
messages delivered completely at least once (or skipped) and not 
specifically saved, are erased from the message file. All other messages 
are retained until the next retrieval request. 

When messages are erased from storage, various information 1s 
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tabulated and stored for billing and service analysis purposes. This 
information includes length of message, duration of storage, date and 
time of message recording and message retrieval, number of messages, 
and the busy/idle status of the customer’s line when call was answered. 
This information will be analyzed in detail to better characterize 
customer answering service needs. 


Vil. CALL ANSWERING—SERVICE DEACTIVATION 


A CA customer may turn off or deactivate the service at any time. 
The procedures are the same for DCA and MCA customers, but the 
action taken by CA service is different. The procedure is for the 
customer to dial the deactivation code (Table I) and listen to a service 
prompt indicating that the service will be turned off. If one or more 
messages are still in storage, they will be delivered if the customer 
continues to listen. Whenever a DCA customer disconnects from the 
deactivation request, all of the customer’s messages, the personal 
greeting, and the service active indicators are erased. Necessary billing 
information, such as duration of service activation and the number of 
calls, is collected and an accounting entry is made to bill that customer 
for the service. 

Whenever an MCA customer disconnects from a deactivation request, 
only the CA service active indicator is removed. The customer’s per- 
sonal greeting and any messages in the customer’s message file are 
retained. The MCA customer will still receive message waiting indica- 
tors until the message file is empty. 


Vill. CALL ANSWERING—SERVICE OPTIONS 


Several CA service options exist that will allow the Boc to customize 
the service parameters of CA service to fit the needs of individual 
customers or groups of customers. Each of the following service param- 
eters may be adjusted to fit the needs of the market place: length of 
personal greeting, maximum length of messages, number of messages 
stored at one time, length of time messages remain in storage, and 
number of simultaneous calls answered for each customer. In addition, 
a Remote Access option can be applied to CA service. Remote Access 
will be described in more detail later. 


IX. ADVANCE CALLING SERVICE 


Advance Calling (Ac) is a message-sending service available to all 
customers served by properly equipped Ess central offices. It offers 
customers the capability to send a recorded message to a designated 
telephone number at a future time. Considerable interaction between 
the customer and the service is required to allow customers to specify 
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the telephone number, to specify the time of delivery, and to actually 
record the message. Extensive studies were done to develop easy and 
logical input procedures for AC. 

The basic service operation is for the casual or infrequent user of 
the service who may not remember the operating procedures. Special 
user prompting announcements guide and assist customers through 
the entire procedure. However, frequent users need not be burdened 
by instructional prompts. For instance, when the service is prompting 
for digits to be entered, either time or telephone number, the reception 
of any digit cancels the prompt and the service begins collecting digits. 
Thus, a customer may “dial thru” prompting announcements. In 
addition, customers with DTMF signaling may dial the digit to cancel 
prompting announcements preceding a message recording operation 
to speed up Service operation. 

Since there is a significant amount of information to be entered into 
AC service, the chances for dialing mistakes is higher than normal. The 
AC service allows certain errors to be corrected. For instance, if the 
service detects an invalid area code or an invalid time of day, the 
service asks the customer to re-enter the information. If the customer 
detects a mistake while dialing, customers with DTMF signaling may 
dial the digit « to signal the service that an error has been made. The 
service then prompts the customer to enter the correct information. 
Furthermore, all information dialed by the customer is repeated to the 
customer for verification. 

The AC service operates in three distinct phases. The first phase, 
called message recording, allows the customer to select the telephone 
number to which the message should be delivered, specify the time for 
delivery (optional), and record the message. The second phase, called 
delivery, is automatically performed by Ac service. It places calls to 
the specified telephone number and upon answer, delivers the cus- 
tomer’s message. The third phase, called delivery status check, allows 
the customer to determine if and when a message has been delivered. 
If a message has not yet been delivered, a customer may cancel any 
future attempts to deliver it. Each phase of Ac service will be discussed 
in more detail. 


X. MESSAGE RECORDING PHASE 


Any residence or small business customer may access AC service by 
dialing the service access code (‘Table I) on a DTMF signaling or rotary 
dial telephone. The customer is greeted with a prompting announce- 
ment indicating that ac service has been reached and the customer 
should dial the telephone number where the message should be deliv- 
ered. The prompt states, “Please dial the telephone number to which 
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you want your message delivered. Dial the entire number exactly as if 
you were dialing it directly.” The customer then enters the telephone 
number, and it is spoken back to the customer for verification. The 
verification response states, “The number to which you want your 
message delivered is (number entered).” If incorrect, the customer 
may reenter the telephone number by following error correction pro- 
cedures. 

The customer is then asked to choose between Customer Specified 
Delivery (csp) or Service Specified Delivery (ssp) by entering the 
appropriate control digit. The announcement states, “If you wish to 
select the time you want your message delivered, please dial the 
number 9. If not, please dial 7.” If csp is selected, the service prompts 
the customer to enter the time of day (hours and minutes and a.m./ 
p.m. indication) at which the first attempt to deliver the message is to 
be made. The service asks the customer to, “Please indicate in your 
local time when you want your message sent. First dial the hour and 
the minutes. Then dial A for a.m., or P for p.m.” (Twenty-four hour 
time notation is not used by the service since tests show a very large 
error rate with that time convention.) The time entered by the cus- 
tomer is spoken back for verification. The announcement states, 
“Thank you. The time you want your message sent is (day of week) 
(time) (a.m., p.m., noon, or midnight)” for example, “Thursday—3:15 
p.m.” Customers may only enter delivery attempt time within 24 hours 
of the current time of day. If a customer does not wish to specify the 
time (i.e., chooses SSD), the first attempt to deliver the message is 
made 15 minutes after the message is entered, but no attempts are 
made during the late night hours. 

The customer is then prompted to record the message. The service 
prompt states, “When you hear the tone please record your message. 
You will have (number) minutes of recording time. After recording 
your message, please dial 2 to approve it for delivery.” The actual 
maximum length of a message is a service parameter to be adjusted as 
experience is gained with the service. Initially, a 1-minute message is 
planned for Ac service. The message must exceed a 1/2-second mini- 
mum message length, and the recording will end if the maximum 
message length is reached or the customer remains silent for 3 seconds 
(whichever occurs first). If the customer dials 3, the message is then 
played back for verification. The service prompt states, “(tone) Thank 
you. Your message will now be played back for your approval. After 
hearing your message, you may approve it for delivery by dialing the 
number 2. If you want to re-record your message, dial the number 6 
(tone).” The playback of the recorded message begins within 1 second. 
When the customer dials the number 2, the message is scheduled for 
delivery. 
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Whenever a customer makes a mistake detectable by the system, an 
appropriate prompting announcement informs the customer of the 
mistake made and voice-responds any information entered. For in- 
stance, if a customer entered only six digits of a telephone number, the 
service would respond with, “We’re sorry. You have not dialed enough 
digits. The numbers you dialed are (numbers).”’ 

The service duplicates the recorded message for reliability, schedules 
the delivery of that message at the appropriate time, and records 
necessary information for future billing purposes. 


XI. DELIVERY PHASE 


If the customer specified the delivery time, (Customer Specified 
Delivery, cSD), the first attempt to deliver that message is made at 
that time. If the customer selected System Specified Delivery (ssp), 
the service will schedule the first attempt 15 minutes after the message 
is entered, unless the time falls into a nondelivery time period. Selected 
hours during the day may be specified as nondelivery periods for SSD 
messages. Typically included are the late night hours (10 p.m. to 7 
a.m.) in the time zone where the message will be delivered. In addition, 
retry attempts for SSD messages will not be made during central office 
busy hours where the call originates. For both Csp and ssp, no delivery 
attempts will be made during an administrative period lasting 10 
minutes at midnight. 

The tentative time for the first delivery attempt, t), is given by: 


| T + 15 min (—0, +15 min) SSD 
ty = ’ 


| (he (—0, +5 min) CSD 
where 


T = time of message entry into AC 

T' = customer-specified delivery time. 

As referred to above, ¢; is only a tentative first delivery time because 
it may be necessary to adjust ¢,; to avoid nondelivery periods. The 
exact time for the first delivery, 7), is determined from ¢, by the 
algorithm in Fig. 2. 

At the time of delivery, the service places a call, over the regular 
telephone network, to the telephone number specified by the customer. 
Voice-presence circuits will “listen” to the call progress tones associ- 
ated with the call attempt, and the service will be able to determine 
the status of the call. It will know that a busy or reorder condition was 
reached or that the called number is ringing. Any other call condition 
that might be reached is treated as a ringing connection, If the call 
attempt reaches busy, the attempt ends and a record is made of the 
busy condition. If a ringing connection is reached, the service will wait 
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Fig. 2—Delivery time algorithm (find 7, from f,). 
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11:55PM St, 
< 12:05 AM 
? 


for approximately 30 seconds (five ringing cycles) for answer before 
abandoning the attempt and records the no-answer call condition. The 
time of the next attempt is then calculated. 

The algorithm for determining the next attempt time considers how 
many attempts have already been made, whether the previous attempt 
reached busy or a no-answer condition, and whether CSD or SSD was 
selected. In general, the time between attempts gets longer as the 
number of attempts increases. Advance calling will make from six to 
ten attempts in no more than 13 hours for csD and in no more than 24 
hours for ssp. It was felt that if a customer specified the time of 
delivery, the message was important and, therefore, delivery should be 
attempted more frequently. 

To determine the tentative time of the next delivery attempt, f¢,, 
when n # 1 
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Li = Th-1 + Ath, 


where Aé, is found in the appropriate Delivery Time Table in Tables 
II and III. Based on the call condition reached on the previous attempt 
and whether a CSD or SSD message is being processed, select from the 
Delivery Time Table the appropriate time interval and add it to T;,-1. 
The Delivery Time Tables also include a range adjustment factor to 
allow the service to smooth out traffic loads over time periods to 
prevent peaking of call attempts. 

In addition to the time-table calculations for ¢,, several other con- 
ditions must be evaluated: 

(tz) The total number of attempts, N, for any message cannot 
exceed 10. 

(ii) If attempt n encounters a busy after attempt n — 1 encountered 
a no answer, set n = 2. This allows for a faster retry on the next 
attempt to increase the probability of delivery since it appears that 
the customer is now home but busy on the telephone. 

(iii) When the above adjustment has been made, but the next call 
attempt encounters a no-answer condition, set n = N — 1, allowing the 
service to extend the time of the next attempt since it appears that the 
customer is really not available. 

(tv) If there have been three consecutive no-answer conditions and 
the last attempt occurred between 11 a.m. and 2 p.m., add an additional 
3 hours to t, calculated from the table. This allows the service to move 
several call attempts into the evening hours to maximize the probabil- 


Table II—Delivery Time Table—CSD 
t, if attempt n — 1 reached 


(minutes) 
n Busy No Answer 
2 5 (—0,+5) 30 (—0,+15) 
3 10 (—0,+5) 60 (—15,+15) 
4 15 (—5,+5) 60 (—15,+15) 
5 20 (—5,+10) 90 (—15,+30) 
6 25 (—5,+10) 120 (—15,+45) 


Table III—Delivery Time Table—SSD 
t, if attempt nm — 1 reached 


(minutes) 
n Busy No Answer 
2 15 (—0,+15) 60 (—15,+30) 
3 30 (~15,+15) 120 (—15,+30) 
4 60 (~—15,+15) 180 (—15,+45) 
5 75 (—15,+30) 180 (—15,+45) 
6 90 (—15,+30) 240 (—15,+60) 
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ity of delivery to customers who work during the day and are home 
during the evening. 

Once ¢, 1s determined from the Delivery Time Tables and the 
additional constraints above, T;, is calculated from ¢, by the algorithm 
in Fig. 2. 

The generation of the delivery algorithm for ac was based on 
evaluations of network completion studies and studies of calling habits 
of telephone customers. But at best, the delivery algorithm is a “best 
guess.” As AC service is used by customers, detailed data on call 
attempts, time of day, and busy/idle conditions reached will be re- 
corded and analyzed. Results of that analysis will be used to modify 
the delivery algorithm to maximize the probability of delivery with the 
fewest number of attempts. 

When the called telephone is answered on any delivery attempt, the 
answering party is greeted by a service Logo and a standard announce- 
ment identifying the call as Ac service and explaining that a recorded 
message will be delivered if the customer will continue to listen. The 
announcement states, “This is the Bell System Advance Calling Ser- 
vice with a personal message recorded earlier for delivery to you. If 
you stay on the line, your message will be played three times.” The 
message is then delivered for the customer. 

The message is considered delivered if the customer hangs up during 
the introductory greeting or the delivery of the message. No further 
attempts will be made. The service will record, for billing purposes, 
the length of the call made to deliver the message. 


XIl. DELIVERY STATUS CHECK 


A unique aspect of AC service, is that customers may check on the 
delivery status of their messages up to 48 hours after the message is 
entered into the system. A customer may check the status of a message 
or messages by dialing a Delivery Status Check access code (Table I) 
from the same telephone where the message was originated. If a 
message or several messages exist for-a single destination, the status of 
each message will be returned with an indicator of the day and time 
each message was entered into the system (1e., “your message of 
Thursday 3:15 p.m. was...). If messages have been recorded for deliv- 
ery to several telephone numbers, the customer is asked to enter the 
telephone number of the message for which the status is desired. The 
appropriate status is then returned. 

For each message status requested, one of five possible responses is 
given: 

(tz) The message could not be delivered (the service has stopped 
all further attempts); 
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(it) The message has not been delivered, but further attempts are 
continuing; 

(iit) The delivery call was completed, but the called party hung up 
before the message was delivered; 

(tv) The message was partially delivered when the called party 
hung up; or 

(v) The entire message was delivered. 
When a delivery call is completed, the day and time of the completion 
is included in the status report. 

Another feature of ac is that when a customer receives a status 
report that a message has not yet been delivered, the customer may 
cancel any future delivery attempts for that message by dialing the 
cancel code, 0. The service then verifies the cancellation of a message. 
The status of messages remains available to customers until a delivered 
or undelivered status is reported to the customer or until 48 hours 
have elapsed since the message was recorded. The actual message is 
erased when delivery is made or when no further attempts to deliver 
are to be made. 


XIli. BILLING 


There are two parts of the billing for ac service. Customers are 
charged for entering a message into the service, regardless of the 
outcome of the delivery attempts. This message recording charge 
covers the cost of recording, all delivery attempts, and one status 
check. Additional status checks are charged on a per-call basis. Cus- 
tomers are also billed at standard call rates in effect at the time of day 
the message is delivered for any local message unit charges or toll 
charges associated with any delivery attempt that is completed. 


XIV. CUSTOM ANNOUNCEMENT SERVICE 


Custom Announcement Service (CAS) will provide customers, mainly 
small businesses and community organizations, with a low traffic 
announcement service. The service operations are identical to CA 
service, except that callers cannot leave messages and the customer’s 
telephone is never rung while the service is active. Customers may 
activate or deactivate the service at any time by dialing the proper 
access codes (T'able I). During activation, a customer records the 
announcement desired. 

The cas offers the customer the advantage of having the telephone 
completely free for making outgoing calls. It also allows customers to 
specify how many simultaneous calls would be answered by cAs. When 
CAS 1s combined with Remote Access (to be described next), a customer 
may set up a recorded announcement service on a telephone number 
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without actually equipping that number with a telephone. The service 
would be controlled from any other telephone using remote access 
procedures. 


XV. REMOTE ACCESS 


The above description of CA, AC, and CAs has assumed that the 
customers only access the service from their homes or business tele- 
phones. But there is an identified need for customers to be able to 
access their services when they are away from their regular telephones. 
A Remote Access (RA) feature is available for CA, AC, and CAS, which 
will permit customers to access their services from any DTMF signaling 
telephone. 

Each customer who selects the RA feature must use a customer- 
changeable, variable-length (maximum nine digits) Privacy Code (Pc) 
so that the service can identify the customer. In addition, customers 
are given a special seven-digit telephone number which they can call 
when RA is desired. All customers in the same central office code (the 
first three digits of a telephone number) will use the same RA telephone 
number. 

When a customer wants to access a service from a telephone other 
than the telephone on which the service is active, a call must be placed 
to the special RA telephone number (plus area code if needed). This 
call will be a charged call with the appropriate local message unit or 
toll charges applied. Upon terminating to the service, customers will 
be prompted to enter their home or business telephone number and 
their pcs. This must be done using DTMF signaling. The service will 
check the telephone number and the Pc entered. If a match exists, the 
customer is prompted to enter the access code for any service desired 
(see Table I). 

If a match does not exist between the telephone number and the Pc, 
the caller is given a second try. If the second try fails, the caller is 
disconnected from the service. To deter attempts to “break” a Pc (over 
one billion combinations possible) and gain unauthorized access to a 
customer’s service, records are kept on the number of error attempts 
made over some time interval. When a preset threshold is reached, RA 
capability is turned off for that telephone number for some period of 
time and is then turned on again. 

Once customers properly complete the Pc check, they enter service 
access codes and have complete control over their services just as if 
they were at their base telephones. The only restriction imposed is 
that customers must use DTMF signaling to control the services via RA. 
A customer may access CA service to turn it on or off, retrieve messages, 
or change a personal greeting; AC service to send a message, check on 
the status of a message or cancel a message; or CAS to turn it on or off 
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to record a new announcement. Note, that a customer must subscribe 
to RA on a monthly basis, but via RA, a customer could access and use 
DCA service. 

When an RA customer first subscribes to the service, the BOC assigns 
a null pc. This permits the customer to gain initial access to the 
service. When customers pass the Pc check using the null Pc, they can 
dial the Privacy Code Change (Pcc) access code (Table I) and input 
any new Pc desired. By using the Pcc feature regularly, customers can 
repeatedly change their pcs. Further, no Boc records are made of 
customer’s PCs. 

For all services, RA may be assigned to a telephone number without 
actually having a telephone equipped on that line. This is very attrac- 
tive for AC since it allows customers in areas where AC is not directly 
available to send AC messages via RA. To allow customers to make 
efficient use of RA for AC service, an additional “recycle” capability 
was designed. At the end of the message recording phase, when a 
customer would normally have been disconnected, provisions have 
been made to allow customers to enter the AC access code again to 
“recycle” back to the beginning of the message recording sequence so 
that a second message could be sent via the same RA call. Recycle is 
also available during other phases of AC service. Thus, with the recycle 
capability a customer could, on a single RA call, send several Ac 
messages, check the status of previous messages, cancel previous AC 
messages if not yet delivered, and retrieve CA service messages from 
storage. 

As experience is gained with customers using RA, the remote access 
procedures described here may be extended to other CCs services, e.g., 
Call Forwarding. 


XVI. THE FUTURE 


The new services described in this paper offer the telephone cus- 
tomers new and unique communications services. They allow cus- 
tomers to make more efficient use of telecommunications. These initial 
services establish a solid foundation on which to build future improve- 
ments and additional capabilities. The extent to which these services 
are enhanced will be directly determined by the customers themselves. 
Many enhancements have already been studied and defined for cA and 
AC services. The development only awaits customer feedback from the 
initial services. 

One thing is certain for the future. The technology used to provide 
these new services offers potential for other services that will have an 
impact on the way people will communicate with each other. 
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Architecture and Physical Design 
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To provide new basic capabilities associated with the Custom 
Calling Services II (ccs ID) feature package, a new system has been 
developed. This system, the 1A Voice Storage System (1A VSS), is a 
generalized resource with the ability to receive, store, administer, 
compose, and deliver voice messages or announcements. The capacity 
of this system is such that tt can provide ccs II services on hundreds 
of simultaneous calls and serve tens of thousands of concurrent 
subscribers. In this article, the circuit architecture and physical 
design of the 1A vss is described. 


1. INTRODUCTION 
1.1 Background 


Custom Calling Services now available to electronic switching sys- 
tem (ESS) customers have been implemented through software features 
and the existing switching periphery associated with the Ess system 
itself. However, the new Custom Calling Services II (ccs II) feature 
package for No. 1/1A Ess customers has created the need for a new 
functional capability not available in the existing plant: high-capacity, 
high-availability, and rapid-access voice storage. Additionally, the 
complexity and real-time processing demands of ccs II creates the 
need for auxiliary processor support for the implementation of these 
services. In response to these needs, a new node in the Stored Program 
Control (spc) network has been developed: the 1A Voice Storage 
System (1A vss). The 1A vss is a generalized resource with the ability 
to receive, store, administer, compose, and deliver voice messages or 
announcements. The capacity of the 1A vss is such that it may be 
deployed in a centralized fashion, each system serving tens of ESS 
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Central Offices, hundreds of simultaneous calls, tens of thousands of 
concurrent subscribers to ccs II features, and a total community (via 
subtending Ess offices) of hundreds of thousands of lines. 


ll. SYSTEM ARCHITECTURE 
2.1 Basic structural concepts 


The basic attributes of the 1A vss are that it is a self-contained 
stored-program-controlled entity in the spc network, that it intercon- 
nects with subtending No. 1 Esss via trunks, and that it provides 
random access from these trunks to a massive voice storage medium. 
Special new services utilizing these resources are implemented by Src 
within the 1A vss. It follows that 1A vss embodies many of the 
structural and operational attributes of an Ess. The basic components 
of the 1A vss architecture (Fig. 1) are a switching network, highly 
specialized trunk circuits called voice access circuits (VACs), an ensem- 
ble of high-capacity storage devices and their controllers, a central 
processor and program/data store, a peripheral controller, and an 
ensemble of service circuits and data set controllers. 

Although 1A vss interfaces to No. 1 Ess via analog trunks, all 
internal switching and storage of voice is implemented digitally. To 
support this, each VAC is equipped with a Coder/Decoder CODEC which 
transforms analog voice to/from a digital representation. The digital 
bitstreams associated with encoded voice are switched via the digital 
network to storage media controllers, and ultimately to the media 
itself. Voice storage is implemented digitally via high-capacity moving- 
head disk transports. Moving-head disk transports provide high-speed 
random access to stored messages such that the delays experienced by 
humans after requesting message playback are barely perceivable. 

A major objective associated with the 1A vss development has been 
to minimally impact the design of the Ess offices to which the 1A vss 
interconnects. The 1A vss and each subtending Ess must intercom- 
municate frequently to coordinate their efforts toward the implemen- 
tation of service. Rather than defining the need for a new data link to 
be established in the Ess to effect this communication, interoffice 
signaling is implemented via multifrequency (MF) tone packets. The 
switching network (Fig. 1) is used to establish a path between the 
relevant trunk and one of the 1A VSS service circuits. Once the 
interoffice signaling phase of a call is complete, the network is used to 
reswitch the trunk to an appropriate storage controller. 

Because the customer may control [via dial pulse or dual tone multi- 
frequency (DTMF) signaling] the operation of 1A vss at any time during 
a call, a DTMF/dial pulse receiver is permanently assigned to each VAC. 
This implies a tremendous scanning and control load associated with 
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Fig. 1—No. 1A vss architecture. 
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the 1A vss periphery. Much of the burden of peripheral control has 
been implemented via microprocessors distributed throughout the 
periphery itself. Each storage controller is microprocessor controlled, 
as is the peripheral controller. Each of these processors communicates 
with the central control via a functional command language which 
minimizes both the frequency of communication and the amount of 
central control real time devoted to peripheral control. This provides 
the central control with the additional real time required to implement 
the complex ccs II services. 

The 1A vss exists within the spc network as a self-contained office. 
As such it has been designed to meet the reliability standards estab- 
lished for Ess offices. These standards include reliability and maintain- 
ability features in both hardware and software that ultimately limit 
system downtime to mere minutes per year per system. Specific 
hardware features include duplication of critical units, each imple- 
mented with self-check logic design to enable fault detection. The spc 
of 1A vss is the Auxiliary 3A Processor which was originally developed 
for the No. 3 Ess and No. 2B Ess systems. All signal paths experience 
an automatic loop-around test immediately before they are switched 
into a voice path. All programs and data stored on processor memory 
or disk are duplicated. Voice messages are also duplicated. 

At the system level, 1A vss incorporates data links to the Switching 
Control Center (scc) for remote fault monitoring and control, to the 
automatic message accounting recording center (AMARC) for automatic 
message billing features, and to Engineering and Administrative Data 
Acquisition System (EADAS) for traffic analysis purposes. The 1A vss 
trunks are automatically diagnosed via the centralized automatic re- 
porting on trunks/remote-office test line (CAROT/ROTL) system. 


2.2 The voice access circuit 


Each trunk appearance at the 1A vss terminates a vac. The vac 
(Fig. 2) consists of a trunk circuit, a DTMF and dial-pulse receiver, a 
volce-presence detector, an automatic gain control (AGc) circuit, an 
analog to/from digital CODEC, and a high-capacity, two-port FIFO 
buffer. Each function of the VAc 1s under the control and surveillance 
of the central processor via a microprocessor-based peripheral con- 
troller (pc). The vacs and pc communicate via a synchronous, bit- 
serial message protocol through transmission over an internal distri- 
bution network. 

The 1A vss vacs utilize standard Type II E&M trunk circuitry and 
are available in 2- and 4-wire versions. 

An AGC circuit is included to provide for constant playback sound 
level for both prerecorded announcements and prompts and for mes- 
sages recorded directly from customers. The AGC circuit also meets a 
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Fig. 2—No. 1A vss voice access circuit. 


transmission requirement that all messages be played back at —20 vu 
(measured at the vac trunk port) to assure that customers hear 1A 
vss playback at a pleasing level, regardless of variations in level at the 
time of message recording. The AGc circuit 1s under program control 
in that its function may be defeated at the command of the Pc. In the 
disabled mode, the aGc circuit provides a fixed gain, enabling trunk 
maintenance testing. 

At the time of the message’s recording, the speaker may leave 
periods of silence preceding and following his message. These periods 
of leading and trailing silence are detected and removed by 1A vss to 
provide for a pleasing and efficient playback. The voice-presence 
detector in the vac seeks to distinguish between signals with the 
characteristics of voice and signals with the characteristics of noise. 
Additionally, the voice-presence detector reports “voice present” for 
continuous signals at high levels to allow for the storage of frequency 
shift keying (FSK) encoded data. The voice-presence detector functions 
by observing the rate of change of the envelope surrounding the 
incoming analog waveform. A nonvarying or slowly varying envelope 
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indicates steady-state background noise or line noise. A rapidly varying 
envelope indicates voice. The output of the voice-presence detector is 
used for the deletion of leading and training silence from messages as 
will be described later. 

Because ccs II customers may signal vss for control purposes by 
dialing digits at any time during a call, a DTMF and dial pulse signaling 
receiver is permanently assigned to each vac. To overcome the cost 
penalties of a per-trunk assignment, a new low-cost integrated circuit 
DTMF and dial pulse receiver has been developed for 1A vss. The 
receiver is based on charged-couple device (ccp) filter technology and 
eliminates the need for bulky Rc networks. This new receiver performs 
at a level of quality equivalent to that of standard central office 
receivers in approximately an order of magnitude less space and at 
significantly lower cost. The logic circuitry associated with this new 
receiver also receives as an input the incoming E-lead signal over 
which dial pulses from the subtending Ess are repeated. The receiver, 
upon detecting E-lead activity begins to count dial pulses. Once a valid 
dial pulse digit is received, the receiver reports the digit as though it 
were a DTMF digit. Thus, the vac, Pc, and central processor are 
insulated from the distinction between customer DTMF signaling or 
dial pulse control. The receiver queues two received digits until the Pc 
interrogates it in order to lessen the scanning load on the pc. The 
receiver also provides an “early detect” output signal when it begins to 
observe a DTMF or dial pulse digit as an aid to the record/playback 
control over messages, as will be described later in greater detail. Once 
a digit is fully verified, a second “digit present” output is raised to 
inform the Pc that the receiver should be read. 

The VAC receives its control via a bit-synchronous serial data trans- 
ceiver. Each transmission by the Pc consists of a 27-bit word, contain- 
ing 11 address and parity bits and 16 command and status bits. Each 
transmission to the vac is followed by a return transmission to the 
peripheral controller. The bits associated with commands to the vAc 
are retransmitted back to the peripheral controller in the follow-on 
reply to allow for transmission error checking by the peripheral con- 
troller. 

Voice signals to be stored are encoded digitally into a bit serial 
format at 32 kb/second by an Adaptive Delta Modulation (Apm) 
CODEC. This CODEC, originally designed for the Subscriber Loop Car- 
rier-40 system, is driven at its level of optimal performance by the acc 
circuit which precedes it. Extensive subjective testing has shown that 
this ADM CODEC with acc performs at a level of quality comparable 
with that specified as the Bell System standard for transmission. Upon 
recording, digitally encoded voice is loaded into a two-port F1Fro buffer 
at the CODEC bit rate. When a sufficient number of bits have been 
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stored to fill the message storage portion of a disk track (155,078 bits) 
the disk’s controller establishes a path through the network between 
the buffer and a disk transport and transfers the track’s worth of bits 
at the rate of approximately 10 Mb/second. The buffer must be 
serviced only once every 4.85 seconds, and hence the disk’s controller 
must be connected to the buffer relatively infrequently. This frees the 
disk’s controller to simultaneously serve the needs of many buffers. 

The buffer circuit organizes incoming digitally encoded voice into 
1024-bit blocks. Each block consists of 1007 bits of digitized voice, 16 
bits generated by the buffer’s cyclic redundancy (cRc) generator for 
the purpose of error detection over the block, and one “‘voice presence 
in this block” bit as received from the voice-presence detector. The 
CRC character is checked by the disk’s controller upon transmission to 
the disk as an error check over the switching path. Upon playback, the 
disk’s controller again checks the crc characters to verify the integrity 
of the disk medium. As the message is sent from the buffer to the 
CODEC, the cRc characters are again checked and stripped out of the 
bitstream. This again checks the integrity of the transmission from 
disk controller to buffer, plus the buffer’s memory. 

Once the buffer is told by the system to begin recording a message 
sent from the CODEC, it will actually enter a “pseudo record” mode 
until the voice-presence detector indicates “voice present.” In this 
mode, it records bits but discards them after approximately 120 ms. 
When the voice-presence detector indicates the presence of voice, the 
buffer ceases to discard bits and records continuously. This feature 
eliminates “clipping” of the message potentially introduced by the 
operational delay of the voice-presence detector. 

In the event that a customer should seek to control 1A vss through 
DTMF or dial pulse signaling while recording or playback is in progress, 
the DTMF/dial pulse signaling receiver sends a “freeze”? command to 
the buffer. This prevents the customer from losing any of his message 
while signaling. The receiver’s “early detect’? output is sent to the 
buffer and, when active, causes the buffer to suspend its activity. If a 
“digit present” signal does not follow, the buffer is immediately 
“thawed.” If a digit is detected, the Pc and central processor may elect 
to force the buffer into the “thawed” mode if appropriate (the cus- 
tomer’s digit may, however, be a request for some other action on the 
part of 1A vss). 

Because of return loss in the trunk circuit and because of transmis- 
sion echos, a recorded DTMF digit may be fed back into the receiver 
and be perceived as a customer-initiated signal. To prevent this, the 
“early detect” signal freezes the buffer, while the receiver continues to 
listen for DTMF signals. If a DTMF signal is detected while the buffer is 
“frozen,” the signal must have originated from the incoming line and 
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not from the message being played back. If the signal originates from 
the recorded message, the receiver suspends its operation for several 
hundred milliseconds to lessen the frequency of interruption of the 
recorded DTMF signals. 

All vac circuitry, including the buffer, is implemented on two 8- by 
13-inch circuit packs. Thus, the plug-in VAc circuit pack represents the 
growth circuitry on an engineering basis for expanding offices. 


2.3 The storage media controller 


The functional responsibilities of the disk’s controller clearly go 
beyond that of a computer disk controller. This device, named the 
Storage Media Controller (smc), exerts control over the disk media . 
attached thereto, over buffer circuits, and over the electronic switching 
arrangement that interconnects buffers and sMcs. The SMC is a micro- 
processor-based controller endowed with considerable intelligence and 
autonomy. The SMCs communication with the central processor is at 
a high functional level designed to minimize the frequency of com- 
munication with central control and to minimize the real time invested 
by central control over the detailed operation of its periphery. In this 
realm, the smc might receive a work order from the central processor 
requesting that it play the message beginning at physical location x of 
its disk media to customer y, who is connected to vAc z. The smc then 
accesses the data associated with that message, performs security 
checks to assure that proper data are being accessed, verifies the 
integrity of the signal path to the required vac, and begins to transfer 
digitally encoded voice to that vac. This digitally encoded voice is 
physically realized as many segments of data (see Fig. 3), each segment 
consisting of 4.85 seconds of recorded voice and 256 bytes of control 
data. These data include the identity of the customer to whom the 
message belongs, forward and backward linked list pointers to the next 
and previous segment of the message, and miscellaneous data regarding 
the status of the segment. The smc uses the linked list pointers to 
manage the playback of an entire message autonomously, and reports 
back to the central processor when it detects “end of message.” 

A similar function is executed upon message recording, whereby the 
central processor furnishes a set of disk storage locations that is 
available for recording purposes. The SMc manages the building of the 
data portion of each segment as it is recorded. 

The smc performs somewhat differently in order to compose 
prompts or system announcements. Digitally encoded voice fragments, 
each consisting of 63 ms of sound, are “stitched” together by the smc 
to create complete phrases. The central processor sends the smc an 
ordered set of pointers to many fragments that, taken together, forms 
a particular phrase. The smc then seizes the appropriate VAC circuit, 


848 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1982 


HEADER DIGITIZED VOICE SECTOR eee ee 
eSB UIEY a RIES! OF VOICE STORAGE 


SEGMENT 





NUMBER (DN), NUMBER OF 


INCLUDES: J 
POINTER TO NEXT MESSAGE 
voice | VOICE VOICE VOICE 
BLOCK | BLOCK | BLOCK SECTOR 
BLOCKS OF TRAILING 
SILENCE, INDICATION THAT 


SEGMENT, POINTER TO LAST 
MESSAGE SEGMENT, 

CUSTOMER'S DIRECTORY 

THIS IS A VOICE TRACK, (1024 aT (1024 a 2 * s+ (1024 BITS) 
INDICATION OF ACTIVITY 

STATE OF THIS TRACK 









VOICE 
PRESENCE Broce 
(1 BIT) 


DIGITIZED VOICE CRC 


(1007 BITS) (16 BITS) 





VOICE IS PARTITIONED INTO SEGMENTS, EACH CONTAINING 155,078 BITS (4.85 SECONDS) OF 
THE TOTAL MESSAGE. EACH SEGMENT IS STORED SEPARATELY ON THE DISK MEDIUM ALONG 
WITH A DATA HEADER CONSISTING OF A DESCRIPTION OF THE MESSAGE’S SEGMENT POSITION, 
THE MESSAGE OWNER, THE TYPE OF MESSAGE, AND THE STATUS OF THE MESSAGE. 


Fig. 3—No. 1A vss disk data format. 


accesses the fragments from among its disk population, and stores the 
fragments contiguously in the vaAcs buffer circuit. The short duration 
of the speech fragment, taken together with the vast number of 
fragments that can be stored in the smcs disk population enable an 
automatic message composition capability of exceptional quality, var- 
lability, and scope. For example, all prompts could be duplicated in 
several languages, with the target language selected independently for 
the requirements of a particular call. 

The smc executes trailing silence deletion from recorded messages 
as a post-processing operation. After a message has been completely 
recorded, the SMC is instructed by the central processor to autono- 
mously begin with the last segment of a message, inspect it via the 
stored “voice-presence” bits for the presence of voice, and to delete 
segments that contain no voice. When the last segment containing 
voice is identified, its data field is updated to reflect this fact, and to 
indicate the exact location within the segment of the “end of voice.” 
Upon message playback, the smc will transmit all bits in segments 
until the last segment is reached. At that time, only those bits preced- 
ing end of voice are transferred to the vac buffer. 

The smc has the capacity to accept work lists from the central 
processor to conduct up to 80 concurrent record, playback, or compose 
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operations. To implement this, the smc scans the number of vAcs 
assigned to it at any point in time and schedules disk data transfers for 
those buffers currently in need of service. The vAcs that are assigned 
to a particular SMC at any point in time is a function of system load 
only. These vAcs may be any subset of the total vAc population, and 
the members of this subset change dynamically with time. For exam- 
ple, if a customer is hearing the playback of six messages in sequence, 
each of these six messages may have been sent to the VAC associated 
with that customer by six different smcs. Thus, that vAc moves from 
the work list of one smc to the next as playback proceeds. 

The order in which the SMC serves VACs is determined by the 
locations on disk of the stored segments of data associated with those 
vacs. The smc schedules data transfers according to an algorithm 
which minimizes disk head travel and, thus, maximizes the number of 
transfers that can be conducted in a given period of time. 

In addition to managing the transfer of digitally encoded voice to 
and from vACs, the SMC serves as a traditional disk controller to serve 
the central processor’s bulk data needs. Data are stored on the same 
disk media used for voice messages, with special track header fields 
used to distinguish between locations used for data and locations used 
for voice. The amount of storage space allocated for voice and for data 
changes dynamically as a function of system operation and load. 

Messages are duplicated to ensure message reliability in the event 
of a disk transport failure. A message is duplicated after it has been 
completely recorded and all silence-deletion post processing is com- 
plete. A message is always duplicated onto a disk transport associated 
with an smc different from that of the original recording. Message 
duplication is initiated by the central processor causing the initial smc 
to seize an idle vAc and to move the first segment of the message to 
that vac’s buffer. A second sMc is then directed to read the vacs buffer 
as though it were an incoming message. This process is repeated until 
the complete message is duplicated. Because the buffer is filled and 
emptied at the high bit rate associated with buffer to/from smc 
transmission, the duplication process transpires at a fraction of the 
time that was required for the initial message recording. 

Each SMC equipped in the 1A vss may control up to eight disk 
transports. The fully equipped 1A vss has eight SMCs associated with 
it. The disk transports will be described in detail later in this article. 


2.4 Switching within 1A VSS 


There are two electronic space division switching entities associated 
with the 1A vss as shown in Fig. 4. The first is a nonblocking Time- 
Multiplexed Space Division Switch (TMspDs) which serves to intercon- 
nect VACs with smcs. The second is an engineered blocking electronic 
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Fig. 4—No. 1A vss network implemented. 


space division switch, called the Service Circuit Access Matrix (SCAM), 
which serves to interconnect VACs with service circuits and test circuits. 

The TMSDs is under the combined control of the ensemble of SMCs. 
Each smc utilizes the TMSDS as a time-multiplexed switch with regard 
to data transfer to/from up to 80 VAcs on its work list. Each smc 
continuously and sequentially steps through paths to those vacs to 
interrogate the readiness of each VAc’s buffer to conduct a transfer of 
digitally encoded voice. When a buffer is in a state of readiness, the 
SMC schedules the transfer according to the current location of the disc 
track to be used, relative to the current position of the disk’s moving 
head carriage. When the disk is ready for data transfer, the SMc uses 
the TMSDs to establish a path to the buffer and transfers 157,696 bits 
(including encoded voice, check bits, and voice-presence bits) in ap- 
proximately 17 ms. Completing this transfer, the SMC progresses to its 
next active VAC to effect a similar transfer. 

The TMSDsS is utilized simultaneously in a manner as described above 
by aS many as eight smcs. The SMCs operate completely asynchro- 
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nously with respect to each other and, thus, the TMSDS is under the 
control of as many as eight independent entities. To prevent collisions 
of control and/or “lock-up” situations, the TMspDs has integral control 
circuitry which adjudicates conflicts of control among the sMcs, such 
as may occur during circuit failures, or during the duplication of voice 
messages. 

A request for access to a VAC by one SMc, while a second SMC is 
using the VAC causes the TMsDs to put the second request into a “wait” 
state. The second SMC may wait until an internal time-out occurs at 
which time it either retries the attempt or determines that a fault has 
occurred in the TMSDS. 

The SCAM is a two-stage space division electronic switch engineered 
for 0.001 probability of blocking under peak load. The SCAM 1s con- 
trolled by the microprocessor-based peripheral controller in a manner 
analogous to network control in an Ess. All paths in the SCAM are 
automatically verified after set-up but before the signal path is com- 
pleted by a signal loop-around test conducted between the vAc and a 
“hair pin” in the selected service circuit port. 


2.5 The peripheral controller 


A microprocessor-based pc has been included in the 1A vss archi- 
tecture for the purpose of relieving the central processor of the respon- 
sibility for scanning. This peripheral controller also is responsible for 
the SCAM as previously noted. As described earlier, the PC communi- 
cates directly with VACs via a bit-serial communication protocol! utiliz- 
ing a 27-bit word transmitted to and/or from each VAC or SCAM. This 
word includes commands to the VAc (or SCAM) and received status 
from the vAc (or SCAM). Additionally, each command sent by the Pc to 
the vac is acknowledged by the VAc in a response transmission. This 
provides a checking function to ensure reliability of transmission. 

A loop-around transmission is conducted every 40 ms by the Pc with 
each VAC. This serves a basic scanning function associated with trunk 
status and DTMF signals or dial pulse digit reception. (The DTMF 
receiver of the VACs queues digits and E-lead signals to permit this 
relatively slow scanning rate.) Additionally, the Pc may be directed by 
the central processor to perform special communication with VACs at 
any time in order to effect changes in the state of the vac (e.g., wink 
to the ESS, disable the acc circuit, freeze the recording process). 
Certain “histories” of events are recorded by the pc for periodic 
transmission to the central processor. This is exemplified by a history 
of voice-presence activity seen by each VAC to simplify “time-out-on- 
no-voice” processing within the application software. Associated with 
this function, the pc samples the voice-presence detector of an active 
vAc once each 40 ms and builds a file which documents voice-presence 


852 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1982 


activity over intervals of hundreds of milliseconds. The central pro- 
cessor periodically interrogates the Pc to access voice-presence history 
files and, thus, is not forced to scan the Pc at a high rate to implement 
a time-out-on-no-voice function. 

The Pc is duplicated for reliability purposes with one unit active and 
the other unit in a standby state. The Pc incorporates a master clock 
which controls the population of coDEcs. The master clocks of each Pc 
are interconnected in a special arrangement which causes high-speed 
autonomous switch-over from the clock of the active PCs to the clock 
of the stand-by Pcs upon the failure of the active clock of the pcs. This 
switch-over precedes the follow-on switch of Pcs themselves, which is 
initiated by the central processor. Thus, a master clock failure does 
not cause even a momentary system outage because of loss of the 
CODEC clock. A Pc failure is detected by self-check circuitry in the Pc 
and reported to the central processor. The central processor, thus, 
directs the standby Pc to become active. The newly active pc then 
determines the state of the periphery and assumes control. 


2.6 Central processor 


The central processor used in the 1A vss is the Auxiliary 3A 
Processor (AP). Various versions of this processor are also used in No. 
2B Ess, No. 3 ESS, Transaction Network System,’ and No. 5 Crossbar 
Electronic Translator System. The ap has the general attributes of a 
minicomputer with the added reliability of duplicated processors, 
memory, I/O controllers, and communication buses. Self-checking cir- 
cult techniques enable the ap to have rapid fault detection and 
reconfiguration. 

Three types of frame, the ap frame, the supplementary main store 
frame, and the maintenance frame comprise the central processor for 
1A vss. The ap frame consists of duplicated 3A central controls, 
duplicated semiconductor memories and I/o controllers. The supple- 
mentary main store frame consists of additional semiconductor mem- 
ory. The 1A vss uses 768K 18-bit words of memory with the capacity 
to grow to 1024K words of memory. The maintenance frame consists 
of a system status panel, two cartridge tape units used for loading 
programs, and teletypewriter equipment. 


2.7 Moving-head disk transports 


The storage medium employed by the 1A vss consists of an ensemble 
of moving-head disk transports, each with a storage capacity of 300 
million bytes of digital memory. This translates into a total voice 
storage capacity per disk of approximately 21 hours. 

The disk media itself consists of 11 platters mounted on a single 
rotating shaft. Nineteen of the surfaces of these platters are useful for 
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data storage and each is served by a read/write transducer. The many 
transducers, or heads, are mounted on arms which extend into the 
assembly of platters, and the arms are fastened to a common moving- 
carriage assembly which positions the heads radially on the surface of 
the platters. The access time to a particular datum is a function of the 
current position of the carriage with respect to the physical location of 
the datum on the platter. It is to minimize track access response time 
that a minimal head travel algorithm was selected to dominate control 
over job sequencing within the sMc. 

The reliability of the 1A vss is influenced by the reliability of the 
alternating current power which serves the disk transport community. 
To provide virtually disturbance-free power in the face of the uncer- 
tainties of commercial power service, a new uninterruptible power 
supply (UPS) has been developed for 1A vss. This system, called the 
TRIPORT,’ after its internal structure, will react to power failure 
within a cycle of the disturbance and smoothly convert its own battery- 
supplied inverter for backup without creating significant power wave- 
form discontinuities. 


ill. PHYSICAL DESIGN 
3.1 1A VSS physical design 


The 1A vss equipment uses a standard set of devices, apparatus, 
and design tools known as the 1A technology. The 1A technology 
hardware is used in the No. 4 Ess° and the 1A Processor,’ as well as 
various other ESSs now being manufactured by Western Electric. The 
use of this technology allowed 1A vss to take advantage of the present 
manufacturing capabilities of Western Electric. The ap frame, the 
supplementary main store frame, and the maintenance frame were 
under manufacture by Western Electric prior to the development of 
1A vss. 


3.2 Circuit packs 


Circuit packs used in 1A vss are FB-, FC-, and FE-coded packs. The 
FB- and FC-type packs are approximately 4 by 7 inches, with 40 
pinouts and 80 pinouts, respectively. The FE-type packs are approxi- 
mately 8 by 13 inches, with either 80 or 160 pinouts. These packs use 
the standard WE 946/947-type of connector. 

The majority of the circuit packs designed for 1A vss are six-layer 
multilayer boards with path widths of 8 mils. Where the wiring density 
of multilayer boards was not required, double-sided and single-sided 
boards were designed. 

Great attention was paid to maximize the density of circuitry per 
board. For example, the 1A vss vac is implemented as two circuit 
packs—the trunk access circuit (TAC) pack and the buffer pack. The 
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functions located on the Tac pack include the trunk circuit (Type II 
E&M, 2- or 4-wire), a DTMF receiver, an AGC circuit, a voice-presence 
detector circuit, a CODEC, and interface and local control circuitry. To 
achieve this level of density, a new DTMF receiver based on cco filter 
technology has been designed for the 1A vss. Additionally, most digital 
logic has been implemented as custom-integrated circuits using the 
new low-power Schotcky gate array technology. Figure 5 is a photo- 
graph of the 1A Tac pack. 

The buffer circuit pack contains 327,680 bits of random-access 
memory (RAM) organized as a two-port FIFO memory. Each port may 
operate asynchronously with respect to the other at speeds defined by 
the connecting circuits. The 16K RAM used on the buffer pack is the 
Western Electric coded 28A device. Again, to achieve high-circuitry 
densities, a large portion of the control logic on the buffer pack is 
implemented using the gate array technology. 

Before the multilayer art masters or the integrated circuit masks for 
the gate arrays were produced, machine wire-wrapped models were 
built to test the logic design. Figure 6 shows the buffer pack both in its 
wire-wrap version and in its final printed wire-board version using gate 
arrays. Notice that without the use of the custom-designed gate arrays 
to replace a large number of individual Ic devices, the buffer circuitry 
would have required two packs and connector positions instead of one 
pack. Because a large number of TAc packs and buffer packs are 
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Fig. 5—Trunk access circuit. 
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required in a large 1A vss office, the savings in space and cost were 


considerable. 
All 1A vss circuit packs associated with “growable” functions (for 


example, the Tac and buffer packs) are designed to be plugged into 
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and removed from their units without removing unit power. To achieve 
this, a power switch has been integrated into the ‘circuit pack handle 
to allow power sequencing of individual packs to be effected automat- 
ically. Figure 5, the TAC pack, shows this switch. 


3.3 Unit and frame designs 


The 1A vss has been partitioned into functional units, each com- 
pletely self-contained including power converters and alarm circuitry. 
Figure 7 is a photograph of the SMc unit used to control the moving- 
head disk transports. This unit, as well as all of the basic units of the 
1A vss, was designed using the 1A technology. The mounting plate, 
apparatus mountings, circuit pack connectors, backplane boards and 
backplane wiring, and designation strips are all standard apparatus 
used in other Western Electric manufactured units. 

The units are mounted on standard 1A-type equipment frames 
which are 7 ft high by 2 ft 2 in. wide. All frames are 18 in. in depth. 
Table I lists all of the frames required for a 1A vss office. The vac 
frame is the only frame added to operational systems to serve growth 
needs. Each vAc frame has a capacity of 32-voice access circuits to the 
No. 1/1A Ess offices. The minimum number of vAc frames is two to 
ensure reliability of trunk groups. The vac frames do not have to be 
fully equipped. The maximum number of vAc frames is sixteen which 
provides a total system capacity of 512 Tacs. Figure 8 is a photograph 
of the vac frame. 

Other self-contained equipment provided on an engineered basis are 
the moving-head disk transports (minimum 3) and the TRIPORT 





Fig. 7—Storage media controller. 
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Table I—1A VSS frames 





Number 
Required 
Size Per Office 
EXISTING FRAMES 
Auxiliary 3A processor frame 4’4” 1 
(double bay) 
Supplementary main store frame 22" 2 
Maintenance Frame 22" 1 
FRAMES DESIGNED FOR 1A VSS 
Voice access circuit frame ae 2-16 
Storage media controller frame | 4’4” 1 
(double bay) 
Peripheral control frame 4’4” 1 
(double bay) 
Service circuit frame 22° I 
Test frame 22° 1 
Miscellaneous frame 22” 1 
Power control and distribution frame Pip ee 1 
OTHER EQUIPMENT 
TRIPORT cabinet (located in office 22" 3-8* 
power plant area) 
Disk transport 23” x 36” x 40” 3-24* 


(maximum dimension) 


* Based on CCS II engineering requirements. 


cabinets (minimum 3) which provide the uninterruptible ac power for 
the disk transports. The TRIPORTS are usually located with the 
office power plant. 

Almost all cabling between 1A vss units and frames is connectorized. 
The objective of connectorization is to enable the 1A vss to be fully 
assembled, wired, and tested at the factory using the same cables that 
will be used at site. The only cables not connectorized are the scan 
and distribute leads associated with alarm circuitry. 


3.4 Floor plan 


The 1A vss requires approximately two building bays (approxi- 
mately 800 square feet) for the maximum-size office. A fixed floor plan 
is specified in order to use predesigned cabling. This not only ensures 
cable lengths to meet electrical requirements but also eliminates the 
line engineering of each cable. Figure 9 shows the floor plan layout of 
the 1A vss. As mentioned earlier, the only equipment provided on a 
capacity basis are the vac frames, the disk transports, and the TRI- 
PORTS. The office power plant and TRIPORT frames are usually 
located in a separate power room. Figure 10 is a photograph of a 1A 
vss model. 

The wiring aisles of the frame line ups are 3 ft wide as compared to 
a typical 2-ft width in most electronic switching systems. This allows 
normal office cooling methods to be used, even though a fully equipped 
vAc frame dissipates approximately 1200 watts. 
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Fig. 8—Voice access circuit frame. 


The commercial disk transports used in 1A vss are designed so that 
cabling enters the cabinet from the bottom. This is consistent with the 
normal application of disk transports in computer rooms which use 
raised floors with all cabling being done under the floor. The 1A vss 
has been designed to be located on either a raised floor or on a regular 
floor. Apparatus is available to support either cabling to the disk 
transports under a raised floor or in overhead cable raceways. 


iv. CONCLUSION 


A new centralized system has been developed for the purpose of 
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Fig. 9—Voice storage system floor plan. 


adding high-capacity voice storage and voice storage processing capa- 
bilities to the spc network. This system has been developed using a 
mixture of new periphery design and use of an existing processor 
complex. The system has been implemented in a generalized fashion 
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Fig. 10—Model of 1A vss. 


which allows the addition of future storage services with minimal (if 
any) impact upon the system hardware architecture. 
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1A Voice Storage System: 


Software 


By G. W. GATES, R. F. KRANZMANN, and L. D. WHITEHEAD 
(Manuscript received July 31, 1979) 


The new Custom Calling Services IT (ccsII) have been provided by 
adding a 1A Voice Storage System (1A vss) as a new node in the 
Stored Program Control network. Software and a new trunk circuit 
are required in the No. 1/1A Ess to provide call control, call filtering, 
and routing toa 1A vss. The 1A vss accepts the call and provides the 
package of voice services known as ccs II. The software required to 
provide these services is described. 


I. OVERVIEW 
1.1 Design considerations 


The software required to implement the new line of Custom Calling 
Services (ccs II) being developed for the Bell System exists in both 
No. 1/1A Ess and in the 1A Voice Storage System (1A vss). The 
software in No. 1/1A Ess is required for call screening, for determining 
which calls should receive service by 1A vss and for dealing with the 
interaction of existing services with this new class of services. 

There is a strong interaction between the Ess and 1A VSs in providing 
ccs II services. To the extent possible, all service control has been 
placed in the 1A vss. In addition, all customer data and control data 
are maintained on the disks in 1A vss. In a general sense, as soon as 
Ess determines that the calling party requires a 1A vss-provided 
service, the call is routed to 1A vss. All control data are permanently 
maintained in 1A vss and required data are sent to ESS when it is 
needed. When the data are no longer needed, as when a call answering 
customer deactivates, ESS destroys the data so as to regain the 
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memory space; the data are retransmitted when the customer next 
activates.’ 


1.2 The SPC network 


The partition of function between the Ess and the 1A vss subsystem 
is an example of the growing trend in the Bell System to specialize 
functions in the various nodes of the Stored Program Control (SPc) 
network. The spc network is the name applied to the collection of 
stored program controlled systems which provide customer services 
and Bell operating companies (BOCs) administrative services. These 
systems are interconnected by trunks and data links and, hence, are 
referred to as a network. The network includes the increasing number 
of Electronic Switching Systems (Esss), the Operations Support Sys- 
tems (osss), the Traffic Service Position Systems (TSPSs), and 1A vsss. 
This growing network of stored program controlled systems permits 
increased sophistication in customer features and in techniques for 
providing these features. The 1A VSS services are an example of the 
concentration of customer feature implementation in a specialized type 
of node in the spc network. Such a node can provide its specialized 
features to many class 5 offices by having calls routed to it for service. 

Figure 1 illustrates the role of 1A vss as a node in the spc network. 
The connection to class 5 No. 1/1A Esss and to osss is shown. 


li. SOFTWARE IN NO. 1 ESS 
2.1 Partition of functions between the host office and 1A VSS 


An early objective in the design of 1A vss features was to minimize 
the impact on the interconnecting (host) Ess and to place the major 
burden of responsibility on the 1A vss itself. There were several 
reasons for this important principle: 

(t) The 1A vss processor and system software were new and, hence, 
provided flexible vehicles which could more readily support functional 
changes as the system matured. 

(ii) Although the 1A vss was initially to serve the No. 1/1A Ess host 
office, other host systems such as No. 2B Ess, No. 3 Ess, No. 5 Ess, 
and No. 5 Cross-bar Electronic Translater System were also considered 
as potential candidates for the future. Any function performed within 
the 1A vss would need to be developed only once, whereas each host 
office function would require separate development on each system. 

Figure 2 depicts the major functions performed by the host Ess and 
the 1A vss. The 1A vss itself provides the capability for recording, 
storing, and returning voice messages and announcements, and for 
interacting directly with the customer to provide the services described 
in the companion article.’ For the customer to make use of these 
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Fig. 1—1A vss: a new node in the spc network. 
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Fig. 2—Ess/1A vss software functions. 


services, the host Ess provides several capabilities, some new and some 
extensions of existing capabilities. Essentially, these capabilities are of 
two types: those that are general and those that are related to specific 
features. General capabilities are as follows: 

(zt) Customers gain access to 1A vss by dialing the special service 
prefix (* or 11) plus two digits, or by dialing seven digits. 

(tz) ESS and 1A VSS processors communicate using an expanded 
form of multifrequency signaling. 

(tit) Service orders are entered from the Ess with subsequent trans- 
mission of service order data to 1A vss. 

(wv) Audits of new and modified data are performed to assure the 
integrity of transient data. 

(v) Maintenance of the Ess trunk circuit and the transmission 
facility to the 1A vss is provided. 

(vi) Resource usage counts are recorded for the software resources 
used. 
Capabilities that are related to specific features include: 

(t) Terminating calls are intercepted and rerouted to the 1A vss. 
Intercept is of three types: immediate, busy, and don’t-answer. Call 
Answering service typifies the use of the intercept capability within 
the Ess. 
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(tt) The customer is provided an indication that the 1A vss has 
voice messages waiting to be retrieved by the customer. 

(tiu) Voice messages are delivered from the 1A vss office through 
the host Ess for the Advance Calling feature. Also included is the 
handling of a special AMA billing message sent from 1A vss for billing 
the terminating portion of the Advance Call. 

(tv) Capability is provided for a customer to monitor a call being 
recorded and to answer the call personally if desired. This feature is 
called Monitor/Cut-Through. 

(vu) Coordinated interaction of the new ccs II features with existing 
features, including those of ccs I, is provided. 
The following section presents an overview of the software required 
for the host Ess in order to implement these capabilities. 


2.2 ESS software design overview 


The host Ess for the initial implementation of ccs II is the 1/1A 
Ess. It is beyond the scope of this paper to discuss the detailed 
structure of the 1/1A Ess implementation for 1A VSs services since it 
would require substantial background in the design of 1/1A Ess soft- 
ware and hardware. Hence, the design will be presented conceptually, 
allowing the reader to mentally apply it to any familiar switching 
system, as appropriate. 

The Ess software for implementing the new Custom Calling Services 
can be viewed as a set of new and modified capabilities, each providing 
a particular part of the service. Figure 3 illustrates those capabilities 
and the control flow among them. The following conventions apply in 
Fig. 3: 

© Circles represent input/output devices as follows: 

_ SO TTY—Service Order (so) Teletypewriter. 
LINES—Both 1A vss customer lines and others. 
TRUNKS—Both interoffice trunks to the 1A vss and to other 
switching offices. 

© Rectangles represent the various functional capabilities. 

The primary function of each 1A vss-related capability is discussed 
below. 

The Service Order Handler accepts messages from the SO TTY, 
screens messages to ensure that they are syntactically correct and that 
the subscriber specified by the message is permitted access to the 
services specified. It also assembles appropriate SO messages to send 
to the 1A vss via the Data Message Sender. All 1A vss subscriber 
service orders pass through a host Ess prior to transmittal of the 
service order data to the 1A vss processor. This is done primarily to 
consolidate the administrative aspects of service order processing and 
to allow the Ess to make appropriate screening checks. Thus, a service 
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Fig. 3—Host Ess conceptual design. 


order describing a new customer for the ESs can simply identify the 
1A vss services that the customer wishes to purchase as part of a 
single order. Note that the entire customer profile describing all aspects 
of the 1A vss service purchased is maintained in the 1A Vss processor 
and that those items required by the Ess processor are sent to the ESS 
from 1A vss when the subscriber activates the service. 

The Service Order Handler also 1s responsible for processing 1A vss 
customer-related VERIFY messages which allow the Ess craft person 
to check the content of the customer profile in the 1A vss processor. 
This requires the transmittal of a VERIFY request from Ess to 1A vss 
and a VERIFY RESPONSE message in the opposite direction. 

The Incoming Message Dispatcher receives all MF message packets 
from 1A vss trunks, analyzes the dispatch code, and distributes the 
data to the appropriate client program. 

The Voice Delivery Controller provides functions required for the 
delivery of Advance Calling messages. This essentially resolves into a 
terminating call (if the destination telephone is in the host Ess) or into 
a tandem call (when the destination is not in the host Ess), with the 
1A vss incoming trunk serving as the originating terminal in both 
cases. 

The Signal Dispatcher is a collection of routines which interprets 
signals from lines and trunks to decide which other processes should 
be requested to further process the signals. For example, when a 
customer dials the Call Answering access code *51, the Signal Dis- 
patcher interprets the *51 as a request to activate Call Answering 
service and passes control to the Customer Access Controller. 

The Auditor represents the collection of audit programs which 
assesses the consistency of data for 1A vss-related features. Audits in 
ESS systems form a powerful force to maintain the stability of the host 
Ess. Several new data structures for 1A vss services have required 
corresponding new audit software, while extensions of existing data 
structures required modification of extant audits. 

Note that only the Database Manager is allowed write access to the 
database representing the pertinent customer profile data within the 
ESS for active subscribers, whereas many other capabilities are given 
read-only access. The database for an active Call Answering subscriber 
includes such items as message-waiting tone and message-waiting ring 
indicators, Monitor/Cut-Through feature allowed, and identity of the 
trunk group to the 1A vss. Customer service requests that require 
access to the 1A vss are handled by the Customer Access Controller. 
It screens the request to assure that the request is allowed within the 
ESS, formats an appropriate MF packet, selects a trunk to the 1A vss 
and passes control to the Call Message Sender. Since some ccs II 
services are offered on a usage-sensitive basis within the host Ess, 
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potentially all lines within the office can request activation of these 
services. However, some combinations of usage-sensitive services, such 
as Call Answering on COIN lines, are inconsistent or confusing. There- 
fore screening, based on the originating and terminating major class of 
the line, is used to control such situations so that specific services may 
be denied to particular line classes. 

Whenever a ccs II subscriber has an active intercept feature, such 
as Call Answering the Intercept Controller assumes control of any call 
that would normally terminate to the subscriber’s lines. Its function is 
to perform the actions necessary to route the call to the 1A vss. 

The Monitor/Cut-Through Controller processes requests for this 
subfeature of Call Answering service and is illustrated in the sequence 
shown in Fig. 4. The MONITOR function is accessed when the Call 
Answering subscriber dials the appropriate two-digit access code. A 
check of the subscriber’s database is first made to determine whether 
the Monitor/Cut-Through feature is allowed. 

The Status Indication Controller provides for Message Waiting 
Tone (mwt) and Message Waiting Ring (mMwrR) to alert a Call Answer- 
ing subscriber that messages are waiting to be retrieved from the 1A 
vss. Both status indications are under control of the customer’s data- 
base profile established by the service prototype (see section on the 
1A vss Feature Processor Subsystem). The MwT is provided when 
applicable on all call originations using software control of dial tone 
through standard digit receivers. The MwrR is a short burst of ringing 
applied to the customer’s line following disconnect from stable network 
connections involving that line. 

The Call Message Sender and Data Message Sender perform the 
task of transmitting MF packets of information to 1A vss for various 
software clients as described in the previous section. These modules 
perform functions, such as seizing appropriate memory resources, a 1A 
vss trunk, and an MF transmitter, as well as implementing the inter- 
processor communication protocols. 

The Isolation Talking Monitor disables the Call Waiting feature and 
any similar features that may apply tones or switching noise to a 
customer’s line. This capability is invoked principally, while the cus- 
tomer is in the process of recording a greeting or message on the 1A 
vss, but it may be utilized in other circumstances to prevent adverse 
interaction of ccs II with other services to which the customer may 
have access. 

The 1A vss Trunk Maintenance Controller is responsible for provid- 
ing new diagnostic software to verify the operation of the new two- 
port trunk circuit, as well as the standard end-to-end operational test 
provided on many interoffice trunks. Additionally, provision to allow 
the standard transmission quality tests both automatically and man- 
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Fig. 4—Monitor/Cut-Through capability. 


ually from the various test panel configurations is contained within 
the Trunk Maintenance Controller conceptual model. 
2.3 Interaction with existing customer services 


Great care was taken to assure that the new ccs II services mesh 
well with the many customer services already provided by the 1/1A 
ESS. Two examples are given here to illustrate this interaction. 


Example 1—Call Answering interaction with Call Forwarding. 


Both of these services allow the customer to accomplish a similar 
goal, namely, when activated, each will result in calls that would 
normally terminate at the subscriber’s line being routed to an alternate 
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destination. For Call Forwarding, the customer specifies the destina- 
tion; for Call Answering, the new destination is the 1A vss. Obviously, 
both cannot be active at the same time or the intent would be 
ambiguous. Hence, only one type of intercept service is allowed to be 
active at any one time. If one service is active, subsequent attempts to 
activate the other result in reorder tone being applied to the customer 
line. 


Example 2—Call Answering (busy line) interaction with Call Waiting. 


In this case, a priority of action is used. Call Waiting is useful in 
informing a customer whose line is busy that another caller is trying 
to reach the customer. This is done by applying a short beep-tone to 
the customer’s line at intervals of 10 seconds. The customer, wishing 
to answer the new incoming call, verbally informs the original party of 
his intent to do so. He then flashes the switchhook which results in 
the customer being connected to the new party, while the original 
party is placed on “hold.” A subsequent switchhook flash will bring 
back the original connection. However, if the customer also has Call 
Answering service active and elects not to answer the new caller, the 
latter will be intercepted after the first 10-second period and will be 
routed to the 1A vss. In this way, the Call Waiting and Call Answering 
services conflict minimally and provide enhanced call control capabil- 
ity for the 1/1A Ess customer. 

Feature interactions of this sort are implemented wholly within the 
host ESS. 


2.4 The Interface between VSS and ESS 


As shown previously, 1/1A Ess customers gain access to the 1A vss 
via two-way voice frequency trunks interconnecting the two systems. 
Signaling associated with the use of these trunks is accomplished via 
an expanded form of E and M, Multifrequency (MF), wink-start sig- 
naling, which is still the most common interoffice signaling arrange- 
ment in the Bell System. This communication arrangement was se- 
lected since it was most easily adaptable to other existing switching 
systems. 

Communication messages can be divided into two main categories: 
(t) those messages which will normally proceed to a talking connection 
between a customer and the 1A vss—Call Messages, and (iz) messages 
which involve transmission of call control data only—Data Messages. 
Signaling protocol for Call Messages is quite standard, except for the 
content and amount of information to be transmitted. Typical infor- 
mation in an MF Call Message packet specifies the type of call message 
(e.g., Call Answering activation, or Call Answering intercept), the 
restriction class, the 1A vss customer identity (by Directory Number), 


872 THE BELL SYSTEM TECHNICAL JOURNAL, MAY-JUNE 1982 


and the billing specifications. Standard Call Message protocol is illus- 
trated in Figure 5, Section A. 

Consider for example the Call Message MF packet requesting acti- 
vation of Call Answering service for a casual user. The MF packet sent 
from the host Ess to 1A vss would be of the form 


MESSAGE RESTRICTION BILLING 
(KP) DISPATCH CUSTOMER DN CLASS DN (ST), 
CODE 
where 
KP =KEYPULSE DIGIT 


Message Dispatch Code =2-digit code identifying the receiving 
client program in 1A vss. 

=For example, “complaint observing re- 
quested by this customer.” 


Restriction Class 
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Fig. 5—Ess/1A vss interprocessor communication. 
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Customer DN =4-, 5-, or 7-digit form of the customer 
directory number (DN). 

Billing DN =Directory number to which 1A vss 
should bill the charges for this use of 
Call Answering service (optional). 

ST =“START PROCESSING” digit (end-of- 
message). 


There are 99 two-digit message dispatch codes. Data Message MF 
packets have much the same format as Call Message packets. The 
dispatch code distinguishes the packet as a Data Message and identifies 
the client program in the receiving processor. Examples of Data 
Messages are (i) the activation message sent from 1A vss to ESS in 
response to an ESS customer’s activation request, and (iz) service order 
messages and their replies. 

Signaling protocol for Data Messages is necessarily more complex 
than the Call Message protocol since it involves interprocessor com- 
munication without the presence of the customer to detect the success 
or failure of the communication. Figure 5, Sections B, C, and D depict 
the signaling protocol for Data Messages. Note that one of three 
responses is expected from the receiving processor: 

WINK—Implies successful transmission of the message and accept- 
ance by the client program. 

ANSWER—means the MF packet was received but was rejected by 
the client program, for example, because of incorrect format. 

TIME-OUT—implies unsuccessful transmission in the same sense 
as standard interoffice MF signaling. 

To increase trunk usage efficiency for data transmission, capability 
to batch Data Messages is provided. Batching means that multiple 
independent Data Messages may be transmitted over a single trunk. 
After receiving the WINK acknowledgment, the transmitting processor 
will either disconnect, signifying end of transmission, or commence 
sending the next Data Message. 


Ill. SOFTWARE IN THE VOICE STORAGE SYSTEM 
3.1 Software techniques 


Control in 1A vss is distributed among the central processor and 
several microprocessors which control the periphery. The microproc- 
essors provide the necessary low-level repetitive control and relieve 
the central processor of this workload. 

A major goal in the design and implementation of the central 
processor software was that it be easily modifiable. Several techniques 
and rules were used to achieve software that allows new features to be 
added easily. 
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(t) The fundamental technique is embodied in the software archi- 
tecture. The software architecture was designed to partition the com- 
plexity of the system so that designers and programmers have to 
concern themselves with only a subset of the total problem. 

(a) Software that requires knowledge about the detailed hard- 
ware characteristics is concentrated into a few subsystems and 
the need for this knowledge is eliminated from other subsys- 
tems. This technique makes feature development easier by 
controlling the need for detailed hardware knowledge and makes 
it easier to change the hardware and firmware. It also results in 
less overall software impact when hardware and firmware are 
changed. 

(b) The software that controls feature operation 1s concentrated 
in one subsystem. Other subsystems provide high-level service 
functions to the feature subsystem. This effectively creates a 
language of functions which can be used to build and expand 
services. 

(tz) Within the feature subsystem each feature is implemented as 
independent software. To do otherwise would mean that each time one 
feature was changed, other features could be affected. 

(zzz) Separate data structures are built for each feature. Shared data 
structures for one customer seems natural, but if the features are 
completely disjoint, then the data are kept disjoint to avoid interaction 
effects. 

(tv) A high-level language with data structure definition capability 
is used. | 

(vu) Many characteristics of 1A vss operation are implemented as 
system parameters so that they will be easy to change as experience is 
gained from early customer use. Some of these parameters will require 
software recompilation to change, while others can be changed by 
modification to the system in the field. Examples of system parameters 
include: (a) number of seconds of silence before time out during 
recording, and (b) the time to return answer supervision during a call. 
The concept of parameters is oriented toward overall system charac- 
teristics. An analogous concept of options on specific customer features 
is used and is discussed 1n the section on feature implementation. 

(vi) The software was built with the rule, “Design it correctly, build 
it, then tune it.” Tuning a system too early can destroy its structure 
and, hence, destroy its modifiability. 

(vit) The call-related portions of the software were designed and 
implemented using finite-state-system concepts. 

The finite-state-system design technique consists of partitioning the 
software into functional models where each model is viewed as a finite 
state automaton. The model consists of states, signals, and transition 
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routines. The occurrence of an event causes a signal to be sent to a 
model which is in a particular state. The model executes particular | 
transition routines as a function of its state and the received signal. It 
then enters another state to wait for another signal. Figure 6 illustrates 
the diagram of such a model. 

Finite-state design techniques provide a good structure for the 
implementation of call processing. They produce software which is 
self-documented by the state diagrams of each model. Because of this 
documentation and the intuitive naturalness of the structure, the 
resulting software is easy to read and understand. 

These techniques have produced call processing software which 
should be easy to modify as new features are added to the 1A vss. 
Initial indications are that this goal has been met, but several years of 
experience will be required before a final judgment can be made. 


3.2 Software architecture overview 


The 1A vss software runs under control of the Extended Operating 
System (EOS), a real-time control system developed for Auxiliary 3A 
Processor applications. Call handling software runs in a single Eos task 
and is controlled within the task by the State Table Controller (sTc). 
The stc provides the structure necessary to process signals and to 
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control models as required for the vss finite-state design techniques. 
The stc schedules models, queues signals and maintains state control 
for each model. 

The overall structure of the call processing software is shown in Fig. 
7. The software is divided into six major subsystems. The Feature 
Processor controls the actual customer features. It calls on the Re-. 
cording Trunk and the Database Manager for services; these systems 
in turn request lower-level services from the Input/Output Processor, 
the Voice Manager, and the File System. Support services are provided 
by device handlers, disk memory allocation software, a message dupli- 
cation service, and disk erasure software. 


3.3 The feature processor subsystem 


The characteristics of a customer feature are incorporated in the 
Feature Processor subsystem. Like all call processing software, the 
Feature Processor is a collection of finite state models which performs 
transitions from state to state as the various call events occur. Events 
such as “off-hook,” “customer dialed a 6,” or “message playback 
complete” cause signals to be sent to the appropriate model. The 
model executes transition subroutines, sends signals if required, and 
enters another state to await the next signal. Each call in the system 
creates an “instance” of each model as the call progresses (similar to 
a software process). Multiple call capability comes implicitly from the 
collection of all instances of these models. 

The set of Feature Processing models and associated transition 
routines orchestrates the handling of calls but does little of the actual 
work. The work is done by calling on the Recording Trunk and the 
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Fig. 7—Basic structure of call processing software. 
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Database Manager. These two subsystems provide an extensive set of 
high-level functions which constitutes a primitive language for building 
customer services. 

Examples of the type of functions provided include: 

(t) Report origination 
(tit) Report on-hook 
(zz) Return answer 
(tv) Get dialed digits 
(v) Disconnect 
(vz) Seize trunk 
(vit) Send data 
(vit) Say a system announcement 
(tx) Play a customer message 
(x) Compose an announcement from fragments 
(xt) Record a message 
(xit) Erase a message 
(xuit) Secure customer data 
(xiv) Release customer data 
(xv) Modify customer data. 
Many options have been incorporated into each service in order to be 
responsive to the changing needs of the telephone customer. 

The solution to handling the changing needs of the Call Answering 
customer was to implement essentially all the options which were 
considered useful and to provide a way to define a customer feature as 
a collection of these options. Additionally, several packages of Call 
Answering services can be marketed by defining several collections of 
options. A set of options is called a prototype, thus, a package of Call 
Answering options is defined by defining a prototype. The three 
packages of Call Answering to be marketed initially, 1.e., Daily, 
Monthly, and Deluxe, are created by defining three prototypes with 
the associated, required sets of options. Further flexibility was gained 
by providing the capability, through customer service orders, to modify 
each of the options for the individual customer. Thus, a deluxe cus- 
tomer can have the maximum length of a message extended from 30 
to 60 seconds by a service order indicating such a change for that one 
customer. 


IV. THE RECORDING TRUNK SUBSYSTEM 


The Recording Trunk Subsystem is an abstraction of an “idealized 
voice storage trunk.” Such an “idealized” trunk in 1A vss would be 
capable of recording and playing messages and handling timed se- 
quences in an autonomous manner. By abstracting these characteris- 
tics and incorporating them into a software system, feature designers 
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are given a powerful capability for building voice features which do 
not require intimate knowledge of the complex 1A vss architecture. 
The Recording Trunk Subsystem provides the feature programmer 
the ability to: 
(tz) Record a message 
(iz) Play a message 
(uz) Erase a message 
(tv) Return answer supervision 
(v) Control silence timeout 
(vt) Acquire allowable digits 
(viz) Control digit timing 
(viit) Recognize flash signaling 
(ux) Send messages to the Ess office 
(x) Receive messages from the Ess office. 
The Recording Trunk calls upon the Voice Manager (vM) and the 
Input/Output Processor (IOP) in providing functions to the Feature 
Processor. 


V. THE DATABASE MANAGER AND FILE SYSTEM 


Data services are provided to the system by the Database Manager 
and the File System. The 1A vss Database Manager was designed and 
tuned to fit the type of support needed by vss features. Rapid access 
to the customer database is provided by the physical clustering of 
logically adjacent data. Flexible database services are achieved by 
basing the design on the general concepts of the relational model of 
data structures. 

The File System provides for the random access storage and retrieval 
of variable length records. To provide the required reliability, each 
record is duplicated when written. The File System and the Database 
Manager are designed to specifically complement each other so as to 
meet the objective of minimization of data storage and transfer costs. 
The File System also provides storage services directly for administra- 
tive data such as billing and traffic data and the collection of data on 
equipment failures. 


Vi. INPUT/OUTPUT PROCESSOR 


The 10P provides functional control and error detection for the 1A 
vss trunks and service circuits. In this capacity, 1t receives requests for 
service from the Recording Trunk and system maintenance software. 
Control is achieved through interaction with the microprocessor-based 
peripheral controller, with responses from the periphery distributed to 
the requesting subsystem. High-level functional device requests are 
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accepted by the 1oP and transmitted to the periphery as a sequence of 
device commands. 

The 1o0P also receives notification of autonomous events from the 
periphery, e.g., a trunk seizure. These are distributed, as appropriate, 
to the associated Recording Trunk, maintenance or error control 
software. Timing control and error recovery are also provided by the 
IOP. 


Vil. THE VOICE MANAGEMENT SUBSYSTEM 
7.1 Voice manager 


The Voice Manager encompasses all software for control and ma- 
nipulation of stored voice. The operational unit of access is the mes- 
sage. Messages may have a variable length and are comprised of one 
or more fixed-length segments. These segments are the fundamental 
units of storage allocation and deallocation. Each message has a unique 
owner. The owner may be either a customer, designating a message 
entered in conjunction with the customer’s service, or the owner may 
be the system, designating a system announcement identified with a 
particular VSS service. 

The Voice Manager provides the Feature Processor three basic 
capabilities for manipulating messages: the ability to record a message, 
the ability to play a message, and the ability to erase a message when 
no longer needed, thus, releasing the space for other uses. Each service 
is defined as a sequence of these operations with appropriate system 
announcements played to prompt the customer. 

Because it is impractical to store certain system announcements in 
prerecorded form, e.g., “You have seven messages,” the vm provides 
the capability to play such messages from a small set of prerecorded 
fragments. This allows the Feature Processor to specify the phrase 
“You have seven messages” as follows: 


Specification Fragments 
Fragment identifier “you have” 
Decimal number seven 
Fragment identifier “messages.” 


To allow a reasonable range of numbers, individual fragments are 
recorded. The following are examples of such fragments: the numbers 
1 through 19, plus 20, 30, such phrases as a.m., p.m., and the names of 
the days of the week. 


7.2 Storage media controller handler 


The Storage Media Controller (smc) handler provides the commu- 
nication path between the Voice Manager and the microprocessor- 
based peripheral device which performs the voice commands. 
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The structure of the handler is governed by two characteristics of 
the SMc: its ability to service many active calls simultaneously and its 
highly autonomous operation. The handler defines four phases of 
operation, described below, and provides all necessary synchronization. 
These phases are staggered for the equipped SMcs to smooth the main 
processor load. 

Command Phase—The handler transmits all voice commands ac- 
cumulated during the last cycle. 

Voice Phase—The handler is dormant. The smc schedules and 
performs the commands it has been given, signaling the handler when 
complete. 

Reply Phase—The handler reads the status replies indicating the 
disposition of the commands performed during the voice phase. 

Disk Controller Phase—The smc performs the functions of a con- 
ventional disk controller, e.g., data read/write. This state holds until 
the start of the next cycle. 


7.3 Storage allocator 


The Storage Allocator is charged with managing the storage avail- 
able for digitized voice in the SMc community. This storage is addressed 
by specifying the smc, Disk Transport, and segment. The number of 
smMcs and Disk Transports varies with office configuration; segment 
numbers are a property of disk geometry. The basic strategy of 
monitoring the idle/busy status of segments and providing rapid 
allocation/deallocation of resources employs a combination of main- 
memory-resident segment address lists and disk resident maps. The 
memory lists provide for normal, rapid resource allocation/dealloca- 
tion, while the disk maps maintain the idle/busy status of all resources 
and provide a permanent record of disk configuration. 


Vill. SYSTEM MEASUREMENTS 


Since 1A vss is a switching-type entity, it requires traffic and billing 
data and interfaces to both the Engineering and Administrative Data 
Acquisition System (EADAS) and the Automatic Message Accounting 
Recording Center (AMARC) to make these data available to the Boc. 
However, 1A vss provides an entirely new class of vertical services, 
ccs II, without precedent. Hence, little information exists upon which 
to gauge, for example, customer response to the new services or traffic 
engineering rules. 

A single, unified data collection mechanism is provided to meet data 
collection requirements. For each system activity of interest a unique 
action is defined with regard to data content, reasons for collection, 
method of collection, and intended uses. Because of the different uses 
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of the data, two primary collection methods are provided: the peg 
count, typically used for traffic actions which appear on the quarter 
hourly reports, and the transaction file, in which the action, and 
associated parameters, are written into a disk file. 


IX. SECURITY AND RELIABILITY 


A major concern during the design of 1A vss has been the protection 
of customers’ messages. The two aspects of this concern are that a 
message should be returned only to the correct customer and that 
messages should not be lost. To guarantee the correctness of delivery, 
the identity of the customer owning the message is stored in the 
control portion of each message segment. During playback, the identity 
of the requesting customer is passed to the smc and each message 
segment is validated before being played. During recording, each 
message segment is checked before being written to verify that its 
current owner is the Storage Allocator. These checks protect against 
several types of failures that might cause a message or a message 
portion to be played to the wrong customer. 

Messages are protected against loss by replicating each one. System 
announcements and voice fragments are replicated on each SMC and 
are accessed via translators. This is done both for reliability and 
availability. Because of the lower access rates, customer messages are 
duplicated for reliability. To accomplish this, the following deferred 
duplication scheme is used. As the customer speaks the message, one 
copy is recorded in real time. Upon completion of this recording, a 
duplicate command identifying this newly recorded message is placed 
on a queue. A background program serves this queue as processor and 
SMC time permits, and records the second copy of the message. 'T’o 
provide the desired quality of service, a system parameter specifies the 
maximum tolerable elapsed time to duplication. If this value is ex- 
ceeded, duplication takes precedence over other new work, until the 
desired level of service is restored. 


IX. SUMMARY 


ccs II services are provided jointly by software and hardware en- 
hancements in No. 1/1A Ess and by a new voice processing system, 1A 
vss. The software for these services has been provided in such a way 
as to provide economical service and to permit straightforward expan- 
sion to new voice services in the future. The 1A vss software contains 
the basic voice control software functions needed for many types of 
voice services. These building block functions permit expansion of ccs 
II to meet marketing requirements. 
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1A Voice Storage System: 


Office Engineering, Maintenance, and 
Reliability 
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The quality of service that will be delivered by the Voice Storage 
System (1A vss) is influenced by a number of diverse factors which 
are addressed in this paper. High intrinsic reliability is designed into 
the system at every level; it is built from system redundancies, defen- 
sive software strategies, hardware self-checking, and manufacturing 
quality control. Good service also requires a complement of equipment 
which can adequately handle peak traffic loads; the sizing of this 
complement is the job of Office Engineering. Effective maintenance 
software will minimize, and in most cases obviate, the impact on 
service of circuit component failures. And finally, effective plans and 
test equipment are required to ascertain that a newly installed 1A 
vss properly performs its intended functions. 


I. INTRODUCTION 


The 1A Voice Storage System (1A vss) comprises a new functional 
node in the telephone network, which will provide a number of new 
services classified generally as Custom Calling Services II (ccs II). 
These services provide the means for conveying voice messages be- 
tween customers who do not happen to be at their telephones at the 
same time. 

A conventional telephone switching office mediates communication 
between two parties by setting up a connection over which they may 
converse. Unlike a switching office, the 1A vss takes on the role of one 
of the conversing parties. In the course of handling a typical call, the 
1A VSS receives input control signals at various times, outputs system 
announcements in response, and either records or retrieves customer 
messages. 
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The quality of service is important; yet, this type of communication 
is complex; there are many opportunities for system malperformance 
to disrupt the call. The following paragraphs introduce several factors 
that contribute to high quality service; they are expanded upon in the 
remainder of this article. 

Office Engineering, which is covered in Section II, is the process 
through which a Bell operating company (Boc) decides specifically 
what equipment to order to provide ccs II.’ Inherent in the process 
are estimates of the amount of traffic to be expected as a function of 
time. This information is derived from market studies which project 
the expected usage of specific services in specific customer sectors. 

Once a system is put into service, however, it is important to obtain 
feedback which will allow the early usage estimates to be examined in 
light of actual market acceptance. With such information, intelligent 
decisions can be made for office growth, which tracks increasing service 
demands. The collection and utilization of these usage data is covered 
under Traffic Measurements in Section 2.4. 

With 1A vss services in use, the system is being entrusted with large 
numbers of customers’ messages. Extensive precautions have been 
taken in all phases of the system design and manufacturing process to 
keep these messages secure and to assure their delivery. The design 
and manufacturing strategies employed to assure dependable 1A vss 
performance are covered in Section III under the heading of Reliabil- 
ity. 

Although the best quality obtainable with present technology is 
built into the 1A vss components, precautions are taken against the 
occurrence of component failures by providing spare equipment and 
maintenance software which can locate the malfunctioning component. 
This software then responds to system troubles by configuring a 
working system that excludes the faulty unit. The philosophies and 
methodology employed to keep 1A vss operational and to facilitate 
repair are presented in Section IV. 

As a final step in office installations, many system tests are per- 
formed by the Western Electric installation organization. These are 
followed by Boc tests which verify the system’s interfaces with other 
systems. Associated feature tests verify that the 1A vss works properly 
when accessed through the connecting Ess offices. This testing is 
covered in Section V. 


ll. OFFICE ENGINEERING 
2.1 Planning 


The engineering of a 1A vss office and of the associated equipment 
in 1/1A Ess is performed to determine the specific equipment that is 
required to provide ccs II. Initially, trial office engineering is performed 
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for economic feasibility study purposes. Such trial economic studies 
result in a decision as to whether or not to proceed with a local offering 
of ccs II. After a decision is made to proceed with 1A vss, an order is 
placed for the required equipment. 

To engineer a 1A vss office and the equipment in the associated 1/ 
1A Ess offices, ccs II marketing information is necessary. Estimates of 
the rate of penetration and the ultimate market penetration for the 
services being provided are required for three types of stations: resi- 
dential, small business, and large business. Providing this essential 
input is the responsibility of the Boc marketing organization. 

The schedule for introducing ccs II in the 1/1A Ess offices must 
also be determined. For each 1/1A Ess that will provide these services, 
the number of residential, small business, and large business stations 
is required. The estimated annual growth rate for each of these 
categories, and the period of time for which the equipment is being 
engineered, must also be specified. 

The above marketing and 1/1A Ess information comprise the re- 
quired inputs to the worksheets which guide BOc engineering personnel 
through the numerous calculations required to determine the amount 
of equipment required to provide ccs II services and to support their 
growth. 


2.2 Storage and offered load 


Storage time is a new traffic parameter associated with 1A Vss. 
Storage time is defined as the time a voice greeting or message is 
stored in 1A vss before it is erased. The average storage time multiplied 
by the number of greetings and messages recorded during this time 
gives the number of simultaneous greetings and messages in the 
system. This, in turn, multiplied by the average length of a voice 
greeting or message determines how much storage is required, and 
hence the number of disk transports in a 1A vss office. 

The two main parameters that must be determined to engineer a 1A 
vss office are (1) the offered load (erlangs) between each 1/1A Ess and 
1A vss, and (iz) the total storage (disk capacity) required in the 1A 
vss. These are calculated from numerical arrays which contain market 
penetration estimates and traffic characteristics tabulated by service 
and customer types. The two-way trunks between 1/1A Ess and 1A 
vss are engineered for 0.01 probability of blocking. The disk transports 
are engineered for more than an order of magnitude better blocking 
performance than the trunks. 


2.3 Equipment engineering 


For the No. 1/1A Ess offices, ccs II services require new dial pulse 
repeating/monitor trunks, additional program store and additional call 
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store. The increase in program store is required to hold the ccs II 
feature; the additional call store holds the service translations. Since 
additional ESS capacity is required for 1A vss-handled calls and for 
advance calling delivery, some 1/1A ESS processor capacity must be 
allocated for these services. The ccs II services have a minimal effect 
on the 1/1A Ess network and service circuits. 

The 1A vss office equipment’ (see Figs. 1 and 2) can be divided into» 
three categories. The first category is comprised of a minimum basic 
set of common equipment required for all offices. Included in this first 
category are the two Auxiliary 3A Processors (labelled generically as 
the Central Processor in Fig. 1), main memory storage, the two 
peripheral controllers, three storage media controllers (SMCs) and a 
service circuit frame. 

The second category covers major frames or units of equipment 
which are either traffic sensitive themselves or which must be provided 
to support traffic-sensitive equipment. Frames or units in this category 
must be provided by the supplier during the system installation, or at 
major growth periods. Typically, this equipment is engineered for a 
two-year period. Included in this second category are voice access 
circuit frames and units, SMCs, and uninterruptible power equipment 
called triports. The voice access circuit frames in this category com- 
monly contain additional unequipped circuit pack locations to accom- 
modate equipment in the third category. 

The third category is comprised of traffic-sensitive plug-in equip- 
ment, which includes trunk access and buffer circuit modules, plus 
disk transports. These units are procured as required and may be 
installed and turned up for service by the BOC maintenance personnel. 
Such growth results from adding 1/1A Ess connecting offices, penetra- 
tion into the potential user market anticipated in the market forecast, 
or a rise in actual trunk group and storage usage based upon traffic 
measurements. Equipment in the third category is typically engineered 
for a 6-month period. 


2.4 Traffic measurements 


Traffic measurements are provided on periodic reports referred to 
as C, H, Q, and D schedules. The C schedule lists measurements of 
trunk utilization, while the H schedule provides measurements of 
internal subsystem utilization. Measurements which are taken to re- 
flect the load on internal subsystem resources include counts of disk 
transport, service circuit and disk subsystem usage, total calls handled, 
processor real time, processor main memory usage, and a record of 
overload control actions. Both the C and H schedules are half-hour 
reports. The quarter-hourly Q schedule is a subset of the H schedule 
that provides a quick look at a few key measures of system performance 
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Fig. 1—No. 1A vss architecture. 
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Fig. 2—No. 1A vss network partitioning. 


that indicate the proper functioning of the vss office. The D schedule 
gives the daily totals of the C and H schedules. Also, the traffic 
measurements on trunks, storage, and usage are enumerated by service 
on the D schedule for specific half-hour periods. 


2.5 Interface to the total network data system 


The 1A vss has two independent data links which comprise the 
interface to the Total Network Data System (TNDS). One connection 
is to an Engineering and Administrative Data Acquisition System 
(EADAS), and the second is to the Network Administrative Center 
(NAC). 

The EADAS polls vss for the H, C, and D schedules. Within EADAS, 
sets of traffic measurement outputs are triggered when preset thresh- 
old criteria are exceeded to indicate heavy traffic conditions. The 
EADAS also supplies the vss traffic measurements to other TNDS 
systems for additional traffic processing and analysis. 

The NAC channel is an interface to the No. 2 Switching Control 
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Center (SCC) system over a dedicated facility. It is used to transmit the 
Q schedule. In the event of an EADAS data link or machine failure, the 
NAC channel serves as the backup for all traffic data (H, C, and D) in 
addition to the normally received Q schedule. 


2.6 Transaction file 


The transaction file is a mechanism for the collection, storage, and 
retrieval of data generated by the vss application software. Raw data 
concerning the software activity or state at given times during a 
program’s execution are written to the disk file. The transaction file is 
processed to produce a detailed engineering, service cost, and human 
factors characteristics of vss services. Included in this information are 
distributions of holding times and storage utilization. The traffic mea- 
surements give the average values of the traffic parameters, but the 
transaction file analysis gives the complete distributions. 


ill. RELIABILITY 
3.1 Overview 


Although experience with previous ESSs may be drawn upon in 
establishing reliability criteria for 1A vss, there are important differ- 
ences. For 1A vss, the term “call” has a special meaning: In a switching 
machine, the primary job is to establish a path (talking connection) 
over which two parties may converse. It does not matter whether the 
parties are human or computer; in either case, once a connection is 
made, the burden of the information exchange is the responsibility of 
the parties, not the switching machine. The 1A vss, however, handles 
one end of the information-exchange transaction; furthermore, infor- 
mation flows alternately in each direction as is typical of a conversa- 
tion. In the course of handling a typical call, 1A vss receives input 
control signals at various times, outputs system announcements in 
response, and either records or retrieves customer messages. This type 
of communication is complex. The system is continuously involved in 
a dynamic interaction with the customer during the entire course of 
the connection, and there are many opportunities for a malfunction to 
disrupt the call. 

At the first level of analysis, the 1A vss may be viewed as two 
primary subsystems: (i) a large disk storage community, and (ii) an 
access system which handles the storage and retrieval of disk infor- 
mation. These two subsystems are quite different in nature: the disks 
are electromechanical moving-head storage devices, while the access 
system consists of program-controlled digital circuitry. Hence, the 
approaches taken to achieve acceptable system reliability must take 
these characteristics into account. 

The first approach used, as a defense against occasional failure of 
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disk transports, is to create duplicate copies of stored messages on two 
different disks. The second approach is to engineer the remainder of 
the system, via which the disks are accessed, to a failure rate suffi- 
ciently low so that it does not contribute significantly to the probability 
of losing a message. The manner in which this is accomplished is 
discussed later. The net result is that the overall system reliability is 
presently dominated by the failure rates of commercial disk transports. 
The 1A vss is expected to provide a grade of service comparable to 
that of ordinary telephone service. 


3.2 General design approach for reliability 


Reliable operation of 1A vss, as with other large systems, depends 
upon numerous design and manufacturing factors. The high probabil- 
ity of long-term reliable operation is built into the system at every 
level—from system design through hardware, firmware and software 
designs, carrying through quality control in the manufacturing process. 

At the system level, extensive redundancy is used. Key units (e.g., 
the Auxiliary 3A Processor) are fully duplicated, and are provided with 
automatic protection switching so that a failure of either unit will still 
leave the system fully functional. Engineered units (e.g., smc and disk 
units), whose quantity is traffic dependent, have on-line spares. The 
system design also encompasses power backup, communication chan- 
nel redundancy, alternate external interface channels, and trunk group 
diversity arrangements to obviate or minimize the impact of compo- 
nent failures. At the hardware level,” self-checking arrangements are 
extensively used in the design. Cyclic redundancy check (CRc) circuitry 
monitors all message data at several points along the storage and 
retrieval path. Parity circuitry checks data transmissions along com- 
munication buses into and out of main memory and over disk storage 
and retrieval paths. An alarm system of ferrod scan points keeps watch 
over power and other hardware conditions. Firmware in the intelligent 
controllers [smc and Peripheral Controller (Pc)] routinely performs a 
number of checks to assure proper setup of switch and matrix connec- 
tions, as well as validity of the transmitted data. 

Furthermore, the analog path over the trunks which connect 1A vss 
with Ess offices is automatically checked by use of an interoffice 
communication scheme which uses MF signalling over these same 
trunks. 

At the software level, 1A vss incorporates an extensive system of 
maintenance, diagnostic, and audit programs which can detect and 
locate a wide variety of hardware faults and transient errors. The 
maintenance system will perform appropriate system reconfigurations 
if necessary. 

It should be noted that 1A vss reliability, and hence availability, is 
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influenced by the time required to repair certain faults. If one of a 
duplicated pair of units has failed, the system still functions normally. 
However, it is now vulnerable to a total outage which would occur if 
the mate unit should fail before the first one is repaired. The proba- 
bility of this happening is proportional to the repair time. The 1A vss 
design incorporates a number of features which aid craft personnel to 
minimize repair time. These include automatic trouble-locating cir- 
cuits, firmware and software, trouble-locating reference material to 
interpret diagnostic results, modes of manual diagnostic program exe- 
cution, built-in test facilities, and a physical arrangement in which 
circuit boards and 95 percent of the interunit cabling are connectorized. 

The quality of the 1A vss equipment itself is controlled by tests and 
inspections at each assembly level through the manufacturing and 
installation process. Basic components are either manufactured by 
Western Electric or purchased under rigid specification to guarantee 
the incoming quality. Once components are assembled onto circuit 
modules, these are thoroughly tested on computer-driven test facilities. 
When the modules are assembled into functional units and frames, 
these are, in turn, tested at the factory prior to shipment. Since a 
standard floor plan is used by 1A vss, the connectorized cables which 
will be used in the field to interconnect frames are included in the 
factory test and shipped along with the frame. 

Final testing occurs at the 1A vss office site where, as part of the 
installation process, the frames are interconnected and operated as a 
system. 


3.3 Hardware redundancies 


3.3.1 Duplicated units 


On IA vss (see Figs. 1 and 2), there are two unit types which perform 
common-control functions on which system operation depends.” These 
are the Auxiliary 3A Processor (AP) and the Pc. Each is fully duplicated. 

Associated with each aP is a complete set of communication buses 
over which it exchanges data with other units. Communication with 
the SMCS Is via a direct-memory access (DMA) parallel channel. The 
even-numbered SMCs connect via one set of bus hardware, while the 
odd-numbered sMcs connect via the second set. This arrangement is 
replicated for the second AP. The A and B power buses similarly supply 
odd- and even-numbered smcs. Taking this arrangement into account, 
the 1A vss software always places the duplicate of a message on a disk 
subsystem (Smc and its disks) of the opposite group (even or odd). 
Therefore, even if an AP should fail, and then one of the DMA channels 
on the good AP should also fail, the system will be able to retrieve a 
copy of every stored message. Since each AP also has a separate DMA 
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bus connection to each Pc, the system can survive a failure of any PC, 
AP, or interconnecting parallel channel. 

Another characteristic of these redundant bus arrangements is that 
a failed peripheral unit can always be diagnosed from the active AP. 
This means that the standby AP can remain at all times in the update 
mode; in this way, the system is better prepared to handle an AP failure 
as well. 

Each of the pcs, which controls the per-trunk circuitry and service 
circuit access via the Service Circuit Access Matrix (SCAM), has its own 
communication channel and interface circuit in each Voice Access 
Circuit (vAc). Hence, failure of either a Pc or its communication with 
a VAC can be bypassed by switching to the other Pc. 


3.3.2 Engineered units—M plus N sparing 


Several unit types are engineered to a quantity determined by the 
traffic to be handled by the office. This, of course, includes the vAc 
units which contain the per-trunk circuitry for up to 16 trunks, and 
the service circuits, such as MF transmitters and receivers. The disk 
subsystems (SMCs and their disks) are also engineered, although the 
quantities can also be influenced by other factors such as reliability 
considerations or traffic measurements. For example, the number of 
disks provided will depend on the average length of time that messages 
are left in the system. In like manner, the Time Multiplex Space 
Division Switch (tmsps), which interconnects the trunk and the disk 
equipment communities, is designed in a modular fashion for conven- 
ient growth. 

A failure of one of these engineered units could degrade service, but 
it could not take the entire 1A vss out of service; hence, the number 
of spares provided is less than full duplication. The sparing philosophy 
used has come to be known as M plus N sparing where, for M in- 
service units, a number of spares N is provided where N < M. Full 
duplication is where N = M. An additional philosophy applied for vss 
is that the spares are kept in service and in use. In this arrangement, 
no unit can be identified as the spare; however, should a failure occur, 
the number remaining in service can carry the traffic at the engineered 
level. 

There are two main advantages to keeping spares on line: (i) they 
contribute to a better grade of service during the large fraction of the 
time when there are no failures, and (zi) they are periodically tested 
by both routine diagnostics and by the operational checks which go on 
all the time in in-service equipment. 


3.4 Automatic trouble detection and protection switching 


As alluded to earlier, a key element in the graceful failure mecha- 
nisms of 1A vss is the action taken by the maintenance and diagnostic 
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software. These programs react to either hardware or operational 
software indications of trouble by testing suspect units or communi- 
cation paths, isolating the trouble area and reconfiguring the system 
so as to bypass the faulty equipment. The diagnostics are also routinely 
called in by the maintenance software for testing of the system each 
night during hours when the system is relatively idle. The maintenance 
software is described in Section IV. 


3.5 Storage duplication 


Whereas 1A vss disks hold programs and data used by the processor, 
the bulk of the storage media is required for messages and announce- 
ments. In general, when any information is stored, it is then scheduled 
_ for duplication on another disk associated with another smc. This 
arrangement allows outage of a disk or of other system components 
without loss of any data. 

Although processor information duplication is scheduled with a high 
priority, analysis shows this to be unnecessary for voice messages. 
That is, as long as messages are duplicated within approximately one 
hour, the delay contributes little to the probability of lost messages 
since this delay is still very short relative to the mean time between 
failures of disk transports. 

However, the advantage of the delayed-duplication philosophy, is 
that priority may then be given to more urgent service-affecting 
activity, such as the handling of calls during a temporary peak traffic 
period. During much of the time when the system is not heavily 
loaded, messages will be duplicated quickly. 


IV. MAINTENANCE SOFTWARE 
4.1 Overview 


A complex maintenance software system is required to enable the 
1A vss to meet the high reliability objectives described in Section III. 
The primary responsibility of this maintenance system is to accept 
error indicators from the operational and administrative software, 
reconfigure the 1A vss such that the suspected faulty unit is isolated, 
diagnose the isolated unit without affecting normal 1A vss functions, 
and help resolve exactly where the fault is in terms of replaceable 
circuit modules. The main interface with the operational and admin- 
istrative software is the error control subsystem. This subsystem 
receives the error indicators and determines the corrective action. If 
the error control subsystem is unable to maintain a working configu- 
ration, the system recovery subsystem is called into action. Other parts 
that make up the maintenance software system are (z) The trunk 
maintenance subsystem, which includes the ability to perform auto- 
matic tests between the 1A vss and connecting client offices, manual 
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tests from either end, and automatic trunk administration functions. 
(ti) The routine diagnosis subsystem, which is responsible for auto- 
matically testing the entire 1A vss periodically. (ziz) The power/alarm 
subsystem, which monitors the system’s power and alarm indicators, 
controls the system status panel, and operates all the other audible 
and visual indicators. (tv) The maintenance administrative subsystem, 
which accumulates hourly, daily, and monthly maintenance status and 
reports. All of the above functions are controlled and administered by 
the maintenance control subsystem. This control subsystem coordi- 
nates all maintenance activities in the 1A VSs, maintains the vss 
configuration database, initiates all diagnostic executions, and admin- 
istrates system reconfigurations. The remainder of this section will 
describe these subsystems in greater detail. 


4.2 Error control and error history analysis 


The error control subsystem receives error indicators from opera- 
tional and administrative software. It is responsible for evaluating 
these error inputs and determining what action to take. Figure 3 shows 
the sequence used to maintain a working configuration when the 
system experiences errors. Error messages sent to error control can be 
classified into the following types: (z) A device handler has lost the 
ability to communicate with its hardware. (ii) An error occurred but 
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the operation was successful because of a retry strategy. (zzz) An error 
occurred and either no retry strategy was employed or it was unsuc- 
cessful. 

When an error message of the first type is received, the unit involved 
is already out of service since fault recovery code is designed into the 
device handlers to cause an immediate reconfiguration 1f communica- 
tions are lost with the unit. In this case, error control does not have to 
resolve the problem. The action taken is to request a conditional 
restore of the unit identified in the error message. The unit is already 
out of service so this request results in a diagnosis of the unit. If the 
resulting diagnosis fails, appropriate failure information used to resolve 
the fault 1s displayed on the maintenance teletypewriter (TTY) and the 
unit is left out of service. Manual action will then be required to restore 
the unit. If the unit passes diagnosis, it is put back in service and the 
history analysis part of error control is informed. In certain instances, 
it is possible for a unit to fail but pass diagnosis and be restored. To 
prevent an endless cycle of this type of occurrence, each time the unit 
is restored, history analysis is informed. If this cycle repeatedly occurs 
in a short time span, history analysis will order the unit to be removed 
from service and to be put in a trouble state. Manual action will then 
be required to restore the unit to service. 

An error message of the second type is called a transient error. No 
attempt to resolve the problem is made by error control. Instead, the 
transient error is sent to history analysis. If many transient errors for 
a particular unit occur in a short time span, history analysis will cause 
the unit to be diagnosed by requesting a conditional restoral. Since the 
unit is in service at this time, it will be removed from service when the 
restore request is received (diagnostics can be run only on an out-of- 
service unit). Once the unit has been removed from service, the actions 
taken are identical to those in the first error-type sequence, including 
the feedback to history analysis if the diagnostic passes. 

Error messages of the third type trigger the greatest amount of 
activity. Some of these messages are such that a specific unit is 
immediately implicated. For these cases, a conditional restore of the 
unit is requested. Other type-three error messages do not allow error 
control to make an exact determination of where the fault lies. These 
messages result in conditional restorals of some suspect units, and 
implication of other units via transient error messages sent to history 
analysis. In the former case, the resulting diagnostic will either find a 
fault or exonerate the units. The action in the latter case is identical 
to that taken for transient error messages sent by operational software. 

It is expected, in the cases where the determination is not clear, that 
the unit implication lists and history analysis thresholds will be opti- 
mized as field experience becomes available. 
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4.3 System recovery 


The system recovery subsystem has the responsibility for recovering 
a working 1A vss when the system must be reinitialized (similar to 
phasing the Ess machines) because of processor problems, when cat- 
astrophic peripheral hardware failures occur, or when 1A vss software 
Insanity occurs. When the system is reinitializing because of processor 
problems, system recovery has little control over the situation. If 
sufficient time does not elapse between successive reinitializations, the 
reinitialization level escalates. At each level, more drastic software 
initialization occurs until the. bootstrap level is reached. Once this 
occurs, system recovery takes its first positive action—it triggers a 
memory reload of all resident memory programs and the office data- 
base. 

As specified in Section III, some of the 1A vss periphery is dupli- 
cated. If a duplex failure occurs, the failure is catastrophic and 1A vss 
ceases to function operationally. When the duplex device handlers 
recognize this occurrence, they send a message to system recovery and 
stop functioning. System recovery will attempt to find a working 
combination of one of these devices and one of the processors by trying 
all possible combinations and by driving the initialization level higher 
and higher each time. Unlike the escalations caused by processor 
problems, if the system recovery subsystem is unsuccessful at finding 
a working combination, the escalations can be stopped manually by 
pressing the manual stop key on the system status panel. When the 
manual stop key is pressed, the 1A vss will stop thrashing through 
initializations and will settle into a quiescent nonoperational state 
where diagnostics can be run manually on the duplex-failed unit until 
the faulty component is found and repaired. When one of the units is 
functional, the manual stop key is released and a manual bootstrap 
will restart the system. 

The system recovery subsystem provides an outlet for software 
modules (including the system recovery modules) that find themselves 
in untenable states from which further processing would cause further 
system software failures. 

When this occurs, a module can send a message to system recovery 
which causes an initialization of the module, and perhaps all other 
modules depending on the level of initialization. This could cause an 
escalation to the memory reload bootstrap level. The manual stop key 
has no effect on software sanity initializations. 

The last interface to system recovery is utilized when the 1A vss 
configuration has deteriorated below a predefined threshold. For 
threshold analysis, the 1A vss is divided into the storage subsystem, 
the service circuits subsystem, and the voice access (per trunk) circuit 
subsystem. When 50 percent or less of the units in any of these 
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categories are in service, system recovery is notified. The first notifi- 
cation will cause a switch of the processors which will raise the level 
of initialization by one. The next unit removed from service in that 
group will trigger conditional restores of all the out-of-service units in 
the affected subsystem. Further reports of units being removed from 
service in this category will be ignored until all the units in the category 
are out of service. If this occurs, all the units in the category will be 
unconditionally restored to service (unconditionally means that no 
diagnostic is run), and the threshold recovery algorithm is reset. The 
algorithm is also reset if sufficient time elapses between any of the 
levels of recovery. The manual stop key functions as described in the 
duplex failure case. 


4.4 Trunk maintenance 


The 1A vss office connects via trunks to the 1/1A Ess connecting 
offices. The main philosophy of the 1A vss trunk maintenance plan is 
that the connecting offices, regardless of type, are the controlling 
offices for the interconnecting trunks. This means that the 1A vss will 
have supporting automatic trunk test equipment, but will not have 
responsibility for initiating trunk facility maintenance. The connecting 
offices execute and evaluate end-to-end operational and transmission 
tests. Nevertheless, the 1A vss does contain a substantial amount of 
trunk maintenance software which is used for trouble detection, trou- 
ble verification, sectionalization, repair, repair verification, service 
protection, and new circuit installation or circuit rearrangement test- 
ing. 

The 1A vss will accommodate the execution of end-to-end opera- 
tional tests initiated from the connecting office. This test, which is run 
whenever the trunk diagnostic is executed at the connecting office, will 
validate the interoffice signaling capability and continuity over the 
transmission path, but will not test transmission quality. This will 
occur automatically at least on a daily basis. The 1A vss will also 
accommodate a Remote Office Test Line/Centralized Automatic Re- 
porting on Trunks (ROTL/CAROT) automatic transmission test. Only 
terminating test equipment exists at the 1A vss; therefore, the trans- 
mission test must be initiated at the ESs connecting office end either 
manually or automatically through a cARoT facility. This test, which 
includes a verification of transmission quality, will be run automatically 
at least once a day. 

The 1A vss provides extensive trunk service protection on its own. 
Whenever an internal 1A vss problem exists which affects one or more 
vacs, the corresponding trunks at the vss end are put in the reverse- 
make-busy state. A trunk is in this state when it is seized from one 
end, but no signaling is sent or accepted by that end. Viewed from the 
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opposite (ESs) end, the trunk is said to be high and wet. This will keep 
the connecting offices from using the affected trunks at the price of 
making them execute some software which deals with the high-and- 
wet state. When the internal 1A vss problem is cleared, the trunk 
circuit will be put back into a normal idle state. 

All carrier groups connected to a 1A vss have hardware Carrier 
Group Alarms (cGAs). A CGA is reported to a trunk maintenance 
module which immediately removes all associated trunks from service 
and releases any service circuits tied to these trunks. This protection 
is required because a 1A vss has only a small number of service circuits 
which could all be occupied by a faulty carrier that causes all the 
associated trunks to be seized. When the cGA is cleared, all the 
associated trunks will be put back in service automatically. 

Whenever a VAC is put back in service, an operational end-to-end 
test is run on the corresponding trunk from the 1A vss end. This test 
does not require any special software at the Ess end. It is used to 
determine if the trunk is high and wet (seized permanently from the 
Ess end), or if it is otherwise faulty so that it should be locked out at 
the 1A vss end. The locked-out condition occurs if the Ess does not 
respond to a trunk seizure from the vss end. A locked-out trunk will 
not be used operationally by the 1A vss but will be treated normally 
if seized by the connecting office. Trunks in the locked-out state will 
be automatically and periodically tested by the 1A vss end-to-end test. 
If it passes, the trunk will be restored to a normal condition. A trunk 
found to be high and wet will be restored to service automatically 
when the permanent seizure from the connecting end is dropped. 

The trunk maintenance subsystem also provides a manual trunk 
maintenance capability from either end of the trunk. The craft person- 
nel at the connecting end can request several test signals from the 1A 
vss, or can be connected to the trunk test panel at the 1A vss. By 
using various TTY input messages and the trunk test panel, craft 
personnel at 1A vss can call test signal generators or the trunk test 
panel at the connecting office end. 

The key points of the trunk maintenance plan for the 1A vss are 
that the extensive trunk maintenance software, along with the error 
control/history analysis software described earlier, provide a powerful 
sectionalization tool for all trunk-related problems. Consequently, 
almost no end problems will require manual restorals of trunks at the 
end where the problem did not exist. 


4.5 Maintenance control 


The maintenance control subsystem orchestrates all maintenance 
activity in the 1A vss periphery. The processor contains its own 
maintenance system. The maintenance control subsystem gets re- 
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quests to remove units from service, to diagnose units, restore units to 
service, and to switch duplex units. These requests come from the 
error control, system recovery, trunk maintenance and routine diag- 
nosis subsystems, and from manual inputs via the TTy from craft 
personnel. Maintenance control is responsible for prioritizing these 
requests and ensuring that no interference occurs between concurrent 
maintenance activities. 

Requests to remove units from service will be evaluated to determine 
the effect of this action. Removing a unit from service may affect 
several other units, or it could trigger particular alarm conditions or 
even system recovery action. Maintenance control must make these 
determinations and take appropriate action. One example of an inter- 
active condition is the removal of an smc from service, which will 
necessitate the removal of all disks connected to it, along with its 
associated second-stage switch (one of the modules in the two-stage 
TMSDS shown in Fig. 2). All three of these unit types are grouped into 
a family, and anything that affects one member of the family is 
evaluated to determine its affect on other family members. There are 
several other family groupings in the 1A vss. 

Another situation arises when the removal from service of an SMC 
would leave the system below a critical minimum number of opera- 
tional smcs (as detailed in the system recovery section). In this case, 
a routine remove request would be denied. 

Requests to diagnose units will cause these units to be removed from 
service if they are active, and then helper units will be selected as 
required before the request is passed along to the diagnostic control 
subsystem. Requests to restore a unit to service will first cause it to be 
removed from service if required, and then diagnosed. If the unit 
passes the diagnostic tests, it will be initialized and restored. A unit 
can also be unconditionally restored which will cause it to be initialized 
and put back into service without being diagnosed. When a unit is 
restored, a family evaluation will take place. For example, this occurs 
when all the disks connected to a particular voice message controller 
are taken out of service; then the storage media controller and its 
associated second stage switch are also taken out of service but are 
put in a nonfault stage. When any one of the disks is restored to 
service, the associated SMc and second-stage switch are also restored 
automatically. 


4.6 Diagnostics 


The 1A vss peripheral diagnostic programs are used to detect faults 
in their respective units; the resulting failure data are then used to 
identify any of the replaceable circuit modules which are faulty. A 
table-driven diagnostic design is used, and a high-level-macro approach 
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facilitates the diagnostic development. The same diagnostics are used 
for frame testing in the factory, for installation testing at the field site 
prior to cutover, and for on-line testing programs, while the system is 
operational. The diagnostics are triggered either manually, by craft 
personnel using a TTY, or automatically as was discussed earlier. The 
manually initiated diagnostics may be originated from either the on- 
site maintenance TTY or remotely from a Switching Control Center 
(scc). The trouble location capability is an integral part of the diag- 
_ nostics and will be described later. In general, the location capability 
is at its maximum when a unit contains a single fault; if it contains 
multiple faults, the location resolution is reduced. 

Each unit diagnostic is a collection of macros used to test the unit. 
These macros expand into data table words when assembled. Each 
macro expands into an OP-CODE or INDEX word, plus one or more 
data words. These data table words are grouped into one or more 
segments each containing less than 2K words. When a diagnostic is 
triggered automatically, all the segments in a particular unit diagnostic 
are run on the unit under test. If the diagnostic is manually triggered, 
all segments or selected segments can be run. A diagnostic segment 
can also run in an interactive mode where execution proceeds to a 
selected point in a segment, or proceeds from a selected point to 
another, or loops over one or more tests. 

Figure 4 shows the structure of the diagnostics. The data tables 
reside in auxiliary memory in the 1A vss disks. When executed, the 
segments are overlayed, one segment at a time, into a 2K paging buffer 
in main memory and are used to drive the diagnostic control program 
which resides in main memory. A simple model of this control program 
is included in Fig. 4. After overlaying a segment of data tables, a task 
dispenser examines the first data table word and extracts the OP- 
CODE or INDEX information. This is used to pass control to the 
appropriate task routine. Each macro type has a corresponding task 
routine which fetches the data words following the OP-CODE or 
INDEX word and executes accordingly. Most of the task routines 
build commands for the 1A vss peripheral devices and then send these 
commands to the appropriate device handlers. These handlers send 
the commands to the devices and they in turn cause the execution of 
one or more device-resident firmware routines. The result of this 
execution is sent back to the task routine via the device handler. If 
this sequence was triggered by a test-type macro, the task routine will 
compare the results with an expected result (supplied in the data 
words of the macro expansion) and will determine whether the test 
passed or failed. | 

Of special interest is the fact that the diagnostic structure resides in 
three different memory media. The data tables are in auxiliary mem- 
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Fig. 4—Diagnostic design. 


ory, the control is in main memory, and the work routines are in device 
memory (firmware). This table-driven, high-level-macro approach to 
designing diagnostics has been proven very successful in other ESS 
developments.’ It has permitted the diagnosticians to concentrate 
efficiently on the functional requirements of the hardware. 


4.7 Trouble location 


Trouble location uses all available information to deduce where a 
fault exists in the system. Once the resolution is made, the fault is 
corrected and the out-of-service unit is restored to service. Trouble 
location may require analyzing error conditions, plant measurements, 
maintenance reports, and any other indications of atypical system 
behavior. No attempt will be made here to describe all possible 
approaches. However, the great majority of all faults will be resolved 
by the unit diagnostics. 

The trouble location information is built into the diagnostic source. 
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A trouble location macro follows each test-type macro in a diagnostic 
segment. This macro requires the diagnostician to comment on what 
the test was checking for, to list the suspected replaceable circuit 
modules in a prioritized order, and to specify any special instructions 
required for the repair process. When the diagnostic source is assem- 
bled into data tables, these macros are expanded into readable repair 
information in the diagnostic listing. The listings are processed off-line 
where the trouble location information is extracted and used to form 
a separate Trouble Location Manual (TLM). When a diagnostic detects 
a test failure, it terminates at that point. Craft personnel can use the 
test number and segment number to find the appropriate diagnostic 
listing or the TLM where the trouble location information will be found. 


4.8 Routine diagnosis 

Periodically, the routine diagnosis subsystem will test the 1A vss in 
an attempt to provide automatic preventive maintenance: Periodic 
testing will detect a fault soon after it occurs, which reduces the 
probability of the occurrence of a second fault before the first one is 
repaired. This is desirable since, as previously mentioned, the power of 
the diagnostics is reduced if the unit under test contains multiple 
faults. Daily routine diagnosis will attempt to remove, diagnose, and © 
restore all the 1A vss units that are in service. If the diagnostic fails, 
the unit will be left out of service. 

There are several special considerations that the routine diagnosis 
subsystem must observe. It will run only during the intervals when the 
system has little or no customer load. It will cause the processors to 
switch only after first triggering a diagnosis of the processor that was 
the standby. It must take hardware family associations into account to 
minimize the effect on the rest of the system. 

Routine diagnosis also has the responsibility for performing certain 
test sequences that are not executed when the diagnostics are run as 
previously described. These test sequences can be run manually if the 
appropriate diagnostic parameters are used, but they seldom will be 
executed this way because of their long execution times. T'wo examples 
of such sequences are the multiple path select tests in the switch and 
matrix diagnostics, and the complete media test and initialization 
portion of the disk diagnostic. These sequences will be run periodically, 
but not on a daily basis. 


4.9 Power /alarm control 


The power/alarm subsystem monitors the system’s power and alarm 
indicators (with the exception of the carrier group alarms described in 
Section 4.4). Examples of these are unit power indicators, bus power 
alarms, and building alarms. The power/alarm software also provides 
routines for other maintenance software to trigger minor, major, and 
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critical alarms in the system. These power and alarm inputs will result 
in appropriate TTY output messages, audible and visual indications, 
and corresponding indications on the System Status Panel (ssp). All 
of these alarms will be sent to the connecting scc system using another 
port on the maintenance TTy channel and telemetry. An interesting 
feature of the 1A vss maintenance philosophy is that the power and 
alarm indicators which provide input to the power/alarm subsystem 
will not trigger any maintenance activity other than the indicators 
described above. The reasoning behind this is that all such power/ 
alarm problems will result in operational problems with the unit or 
units involved and will be handled by the error control subsystem. 

Most 1A vss peripheral units have power and status indicators built 
into the power switch module visible on the unit’s front panel. These 
indicators show if the unit is operating normally, is out of service, is 
powered down or is in a power alarm state. The indicators are con- 
trolled by the power/alarm subsystem. The status of the 1A vss units 
is also displayed on the ssp. If all the members of a particular unit 
type (e.g., SMCs) are in service, the corresponding light on the SsP is 
extinguished. If one or more units are out of service, the light is lit. 
Minor, major, or critical alarms result in output messages describing 
the problem, appropriate indicators on the ssp, audible alarms and 
aisle pilot lights indicating which aisle or equipment is experiencing 
the problem. If the 1A vss does not have resident craft personnel, the 
audible alarms can be shut down at the site and monitored only via 
remote connections to the scc system. 


4.10 Administrative maintenance function 


A great deal of effort was put into the man-machine-interface design 
of the administrative maintenance capability. This capability consists 
of: 

(t) Reporting features which allow the craft personnel to ask for 
the maintenance status of various system components via TTY input 
messages. 

(zi) Automatic outputs of maintenance information on an hourly 
basis. 

(zzi) Automatic outputs of detailed plant measurement data on a 
daily basis. 

(iv) Automatic outputs of a summary of the plant measurement 
information on a daily basis (used by the management in charge of the 
system). 

(v) Monthly measurements and summary records. 

The hourly maintenance information can be tailored by the individ- 
ual system managers. The default case is to report all available 
information. The entire report or selected categories can be printed 
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out or inhibited as desired. Examples of items comprising this report 
are a list of units out of service, entire trunk groups that are inoperative, 
and carrier facilities in the alarm state. The detailed plant measure- 
ments show which units had transient error conditions, which were 
automatically removed from service, which were found faulty as a 
result of the automatic removal, and the measurements also show the 
length of time each remained out of service. Other plant measurements 
indicate the frequency of system reinitializations and the levels at 
which the reinitializations occurred, and provide a calculation of the 
message reliability based on the Mean Time to Repair (MTTR) and the 
Mean Time Between Failure (MTBF) of the storage subsystem. 


V. INSTALLATION TESTING AND ACCEPTANCE TESTING 
5.1 Overview 


As the final phase of the installation, a new vss is put through an 
extensive series of tests to verify proper operation. Most of these tests 
result in TTY printouts of system actions that can serve as a permanent 
record of test results. In addition, a high-temperature test may be run 
at the option of the Boc. The Boc personnel either participate in these 
tests or subsequently review the results. At this point, the system is 
turned over to the Boc for final acceptance testing prior to cutting the 
system into service. 

Acceptance tests are run by a Boc to assure itself of the proper 
operation of a newly purchased system. It is highly likely that no two 
vsss will ever be exactly alike in terms of installed office configuration, 
trunking arrangements, connecting office number and types, and the 
mix of customers served by these offices. Nevertheless, the Boc has 
extensive standard test documentation and the expertise of the West- 
ern Electric installation organization to draw upon in tailoring testing 
to the specific vss office configuration. 

Acceptance tests may include: 

(zt) an audit of the installation tests, 

(zt) tests which check the interaction of the 1A vss with other 
systems to which it connects (e.g., ESS client offices and operational 
support systems). 

The installation tests are described in Section 5.3. In these tests, the 
system’s ability to handle a heavy calling volume is verified by the 
application of a simulated traffic load. The equipment which generates 
these test calls is described next. 


5.2 Call simulation equipment 


The equipment used to provide a simulated traffic load to 1A vss is 
different than that used in Ess offices because the requirements of 1A 
vss are unique. The processor on either an Electronic Switching 
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System (ESS) or on 1A vss can be presented a load by generating a 
large number of short-holding-time calls with computer-driven test 
facilities. In the Ess case, lengthening the holding time on calls would 
additionally load down the network. This is not generally done—it is 
considered unnecessary because networks are well understood and can 
be engineered to a highly predictable level of performance. Long calls 
would, incidentally, also load down the test facility; a much larger 
computer would be required to manage the much greater number of 
calls concurrently extant within the switching system. 

For 1A vss, both a high calling rate and long call holding times are 
required. It is true that the processor, which handles the call setup 
and tear-down, is loaded by a high calling rate. However, unlike the 
situation in an Ess, the 1A vss processor also has work to do during 
the course of a call to handle the storage and retrieval of messages. 
Furthermore, the SMCs and disks are loaded most heavily by lengthy 
messages which must be stored and retrieved. 

The corresponding throughput capacity requirement on the call 
simulation equipment is large. This is handled by a number of micro- 
processor-driven call simulators which provide a distributed load. In 
each VAC unit (of which there are two in each VAC frame) there is a 
normally unoccupied circuit module location having backplane wires 
to each Trunk Access Circuit (TAC). A call-simulator board may be 
plugged into this location to provide a call load on any or all of the 16- 
trunk circuits in this vAc. Simulated traffic is applied to selected 
trunks by the insertion of a strap plug (in lieu of an actual trunk 
connection) for each such trunk on the vAc backplane. Each call 
simulator can generate traffic autonomously, or can be monitored and 
controlled from a common facility. 

The call simulators are capable of generating any or all of five call 
types associated with the Call Answering service." They produce 
messages comprised of a pulsating tone that is encoded to contain a 
check number; message length may be set from 10 to 80 seconds in 10- 
second increments. Front panel switches (see Fig. 5) or external control 
may select the call type and message length parameters. Additionally, 
the E&M lead signalling is handled as is the generation of multifre- 
quency (MF) digit strings which contain call type and directory number 
information for the vss. Retrieved messages are decoded and matched 
for a correct check number. Counts of total calls and of errors in each 
of five categories are recorded by the microprocessor. They are also 
selectively displayed on a three (hexidecimal) digit read-out on the 
front panel. 

At the shortest (10-second) message length, a call simulator is 
capable of generating in the vicinity of 160 calls/hour/trunk depending, 
of course, on the response time of the vss. This substantially exceeds 
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anticipated per-trunk calling rates which are predicated on longer call 
holding times. Therefore, in doing volume tests, message lengths are 
chosen appropriately to obtain the calling rate for which an office has 
been engineered. 

A jack on the front panel provides a 300-baud RS 232 communication 
interface to a common monitor/control unit. This unit contains a 
minicomputer controller and a teletype for output. A maximum of 32 
call simulators, each located in a VAC unit, would be concurrently 
connected to this monitor/control unit in a maximum-size 1A vss. In 
the normal monitor mode, a full report is issued every 15 minutes. 
Others modes are available for hourly, daily, or on-demand reports. 
For each active trunk in the system, a printed line is produced that 
contains counts of total calls and of each of the five error types. Control 
and monitoring of all call simulators from this central point greatly 
facilitates the administration and recording of the integrated and 
maintenance volume tests, as discussed in the next section. 


5.3 Installation tests 


Various final checks performed by Western Electric installation are 
discussed in the following: 

(t) Idle System Tests are those in which the system is caused to 
sequence through the routine diagnostics which are normally run once 
a day. No traffic load is applied. This test verifies that all units and 
subsystems are operating properly. 

(11) Integrated Volume Tests involve a simulated traffic load being 
applied to the system. In this test, the 1A vss is required to carry the 
level of traffic (calls per hour) for which it was engineered, for a 24- 
hour period, during which full grade-of-service requirements must be 
met. 

(vit) Maintenance Volume Tests verify the system’s ability to re- 
cover properly from hardware troubles and software initializations. 
Simulated traffic is applied during the test. The grade of service 
delivered by the 1A vss during induced disturbances is monitored by 
the call simulation equipment and printed out on an associated TTY. 
The printout provides per-trunk counts of the number of calls handled 
properly, and of call-handling errors in different categories. The num- 
ber of such errors, if any, which are allowed depends upon the severity 
of the disruption induced in the system. For example, the manually 
initiated removal from and restoral to service of any units in the 1A 
vss should be accomplished without perturbation of call-handling 
activity. On the other hand, the disabling of an active controller will 
abort a call being set up at that instant, while a major software 
reinitialization could suspend call handling for a matter of seconds. A 
failure of commercial ac power is induced to verify the system’s ability 
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to transfer smoothly to battery-backup power. In general, the system 
design is such that the voltages supplied to vss operational equipment 
are not disturbed by such a failure. 

Also, various additional tests are run to cover interfaces or test 
equipment not checked by diagnostics (e.g., trunk test facilities). 


5.4 Acceptance tests 


During the interval between turnover and cutover of a system, the 
BOC performs acceptance tests to verify that the system will perform 
as engineered. Aside from the optional repetition of any of the instal- 
lation checks, these tests deal primarily with interfaces to other sys- 
tems. For vss this includes the connection to the automatic message 
accounting recording center (AMARC) system, which receives billing 
information; to EADAS (discussed in Section II); which collects traffic 
data; and to the scc for maintenance monitoring. However, the largest 
interface is made up of the trunk groups, which connect to Ess client 
offices. 

A complete test of vss features is made at the option of the Boc. 
Since the presently planned services and features can be thoroughly 
tested over the interconnecting trunk groups by several hundred phone 
calls, the development of special field test facilities has not been 
warranted. These intersystem tests are spelled out in a “script” which 
defines the specific actions (and their timing) to be taken in placing 
each call. A grouping of test telephones has been defined, each of 
which is provided with specific Ess features (e.g., one- or two-digit 
speed calling, call forwarding, call waiting, etc.). Appropriate calls 
placed on these phones can then verify that the combination of the 
ESS with its software, and the 1A vss with its software, are interacting 
correctly so as to produce the expected announcements or other 
responses. 


VI. SUMMARY 


A conventional telephone switching office mediates communication 
between two parties by setting up a connection over which they may 
converse. Unlike a switching office, the 1A Voice Storage System (1A 
vss) takes on the role of one of the conversing parties. In the course of 
handling a typical call, 1A vss receives input control signals at various 
times, outputs system announcements in response, and either records 
or retrieves customer messages. Since this type of communication is 
complex, offering many opportunities for a system malfunction to 
disrupt a call, the 1A vss design incorporates a large number of failure- 
defense strategies. 

The quality of service is of utmost importance. This article has 
presented the underlying philosophies and the approaches used to 
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obtain this quality. Several contributing factors which have been 
covered are adequate office ex gineering to assure sufficient equipment 
to handle peak traffic, high intrinsic reliability based upon system 
redundancies and defensive software strategies, and an effective main- 
tenance software system to minimize the effect of failures on system 
service. The incorporation of these system features has provided a 
robust 1A vss system which is flexible and reliable. 
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ACRONYMS AND ABBREVIATIONS FOR VOICE 
STORAGE SYSTEM 


AC 

ADM 

AGC 

AMARC 

AP 

ATAE 

BOC 

CA 
CAROT/ROTL 


CAS 
CCD 
CCS I 
CF 
CGA 
CODEC 
CRC 
CSD 
CW 
DCA 


MISC. 
MTBF 
MTC 
MTTR 
MWR 


advance calling 

adaptive delta modulation 

automatic gain control 

automatic message accounting recording center 

auxiliary 3A processor 

Associated Telephone Answering Exchanges, Inc. 

Bell Operating Company 

call answering 

centralized automatic reporting on trunks/remote 
office test line 

custom announcement service 

charge-coupled device 

custom calling services II 

call forwarding 

carrier group alarm 

coder/decoder 

cyclic redundancy check (or code) 

customer specified delivery 

call waiting 

daily call answering 

direct memory access 

directory number 

disk transport 

dual tone multifrequency 

engineering and administrative data acquisition 
system 

extended operating system 

electronic switching system 

Federal Communication Commission 

first-in-first-out 

frequency shift keying 

input/output processor 

large-scale integration 

monthly call answering 

multifrequency 

miscellaneous 

mean time between failures 

maintenance frame 

mean time to repair 

message waiting ring 
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MWT 
NAC 
OSS 
PC 
PCC 
PCD 
POTS 
PUC 
RA 
RAM 
RC 
SCAM 
SCC 
SMC 
SO 
SO TTY 
SPC 
SRC 
SSD 
SSP 
STC 


TAC 
TLM 
TMSDS 
TNDS 
TSPS 
TST 
TTY 
UPS 
VAC 


VSS 
VM 
VU 


message waiting tone 
network administrative center 
operations support system 
privacy code 

privacy code change 

power control and distribution frame 
“plain old telephone service” 
Public Utility Commission 
remote access 

random-access memory 
resistance-capacitance 
service circuit access matrix 
switching control center 
storage media controller 
service order 

service order teletypewriter 
stored program control 
service circuit 

service specified delivery 
system status panel 

state table controller 
TRIPORT cabinet 

trunk access circuit 

trouble location manual 

time multiplexed space division switch 
total network data system 
traffic service position system 
test frame 

teletypewriter 
uninterruptible power supply 
voice access circuit 

voice presence 

voice storage system 

voice manager 

volume unit 
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B.S.T.J. BRIEF 


A Josephson Parallel Multiplier 


By T. A. FULTON and L. N. DUNKLEBERGER 
(Manuscript received January 14, 1982) 


This report describes the operation of an 8- x 12-bit parallel multi- 
plier which employs Josephson tunnel junctions (Fig. 1).1* The device 
contains 548 junctions used in two-junction “Jaws’-type logic ele- 
ments.” Ninety-six of these logic elements function as AND gates to 
form the partial products. Their outputs are fed into a Wallace-tree® 
arrangement of 89 full adders, each having one Jaws for the CARRY 
computation and one for the SUM. The thirteen most significant bits 
of the product are returned to the outputs. 

A latching-logic mode of operation is used.’ This employs a five- 
phase ac current supply for power and timing, provided by unipolar 
pulses from a room-temperature word generator. The minimum cycle 
time achieved is 75 ns, with a latency time delay between input and 
output of 30 ns measured at the room-temperature connectors. Both 
times are within design specifications. The latency time is determined 
primarily by the time required for settling of the power-supply pulse 
amplitudes (20 ns total) and by the round-trip cable delay (8 ns) from 
the room-temperature connectors to the chip immersed in liquid He. 
The worst-case logic delay for the Jaws elements (ripple carry through 
18 stages) is estimated as <2 ns, based on the <100 ps computer- 
simulated Jaws delay. In initial tests after preliminary adjustment of 
bias levels (chiefly the five-pulse amplitudes from the word generator), 
the multiplier operates correctly on all of several hundred selected 
combinations of inputs, as judged by oscilloscope monitors. Both 
exhaustive testing and optimization of bias levels remain to be carried 
out. 


* (See Ref. 1 for two recent publications devoted to reviews of the state of the art in 
Josephson digital circuits.) 
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Fig. 1—A photomicrograph of the multiplier chip. 


Circuits were fabricated on 2-inch-diameter oxidized Si wafer sub- 
strates having 12 chips per wafer. The circuit has seven levels. These 
are the Nb ground plane, two levels of ground-plane insulation (an- 
odized Nb and an evaporated Ge-SiO sandwich), the Cu-Ge resistors, 
the Pb-alloy y-direction wiring and junction-base electrodes, the Ge- 
SiO crossover insulation, and the Pb-alloy x-direction wiring and 
junction-upper electrodes. Linewidths of 10 um are used predominantly 
in the wiring and junctions, while some resistors are of 2.5-um line- 
width. 

Fabrication was performed in a small class-1000 clean-room facility. 
Failures were caused primarily by lithographic or other fabrication- 
related defects and, secondarily, by nonuniformities in junction-critical 
currents. Estimates for the yield for defect-free fabrication are roughly 
25 percent. 

The very-high-speed capabilities of Josephson junctions are not well 
utilized in the particular design chosen for the multiplier. It was 
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decided to emphasize simplicity at the expense of performance (within 
the speed specifications mentioned previously) to minimize fabrication 
difficulties. Nevertheless, to our knowledge the circuit is the most 
complex, fully working Josephson circuit described to date. 

We wish to acknowledge substantial contributions to this work by J. 
N. Hollenhorst, S. 8S. Pei, and 8S. K. Tewksbury. We have benefited by 
continued support from R. E. Slusher and J. A. Giordmaine. 


REFERENCES 


1. IBM J. of Res. and Dev. 24 (March 1980) and IEKE Trans. Electron Devices, ED-27 
(October 1980). 

2. T. A. Fulton, L. N. Dunkleberger, and S. S. Pei, “A Simple, High-Performance 
Current-Switched Josephson Logic Gate,’ Appl. Phys. Lett., 34 (1979), pp. 
709-11. 

3. C.S. Wallace, “A Suggestion for a Fast Multiplier,” IFEE Trans., EC-13 (1964), pp. 
14-7. 


JOSEPHSON PARALLEL MULTIPLIER 933 








THE BELL SYSTEM TECHNICAL JOURNAL is abstracted or indexed by Abstract 
Journal in Earthquake Engineering, Applied Mechanics Review, Applied Science & 
Fechnology Index, Chemical Abstracts, Computer Abstracts, Current Contents / 
Engineering, Technology & Applied Sciences, Current Index to Statistics, Current 
Papers in Electrical & Electronic Engineering, Current Papers on Computers & 
Control, Electronics & Communications Abstracts Journal, The Engineering Index, 
International Aerospace Abstracts, Journal of Current Laser Abstracts, Language and 
Language Behavior Abstracts, Mathematical Reviews, Science Abstracts (Series A, 
Physics Abstracts; Series B, Electrical and Electronic Abstracts; and Series C, 
Computer & Control Abstracts), Science Citation Index, Sociological Abstracts, 
Social Welfare, Social Planning and Social Development, and Solid State Abstracts 
Journal, Reproductions of the Journal by years are available in microform from 
University Microfilms, 300 N. Zeeb Road, Ann Arbor, Michigan 48106. 


CONTENTS (continued) 


Software 
G. W. Gates, R. F. Kranzmann, and L. D. Whitehead 


Ofiice Engineering, Maintenance, and Reliability 
P. W. Bowman, J. C. Kennedy, and G. A. Van Dine 


Acronyms and Abbreviations 


CONTRIBUTORS TO THIS ISSUE 
PAPERS BY BELL LABORATORIES AUTHORS 
CONTENTS, JULY-AUGUST ISSUE 

B.S.T.J. Brief: 


A Josephson Parallel Multiplier 
T. A. Fulton and L. N. Dunkleberger 


@) Bell System 


863 


885 


913 


915 
923 
929 


931 


